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THE PARETO-LEVY LAW AND THE DISTRIBUTION OF INCOME* 
By BENOIT MANDELBROT’ 
SUMMARY 


THE PURPOSE of this paper is twofold. On the one hand, we wish 
to give an account of a set of new models for certain distribution 
properties of an important class of economic quantities, which includes 
‘‘income’’ (see [15], [17]). On the other hand, we think that the tools 
which we shall use are as important as the results which we hope to 
achieve: that is, we intend to draw the economist’s attention to the 
great potentital importance of ‘‘stable non-Gaussian’’ probability dis- 
tributions. To give a sharper focus to our paper, we shall develop 
our main points within the frame of a theory of income distribution; 
but our approach will be immediately translatable in terms of similar 
quantities, and our theory may well be more reasonable, or the em- 
pirical fit better, in the case of some other quantities. We may thus 
paraphrase what a famous author said of Brownian motion: it is possible 
that the properties which we sha]] study are identical to those of income; 
however, the information available to us regarding incomes is so lacking 
in precision that we cannot really form a judgment on the matter. 

This paper will be almost exclusively theoretical. Our point of de- 
parture will be an interesting empirical observation, namely, that over 
a certain range of values of income U, its distribution is not markedly 
influenced either by the socio-economic structure of the community 
under study, or by the definition chosen for ‘‘income.’’ That is, these 
two elements may at most influence the values taken by certain param- 
eters of an apparently universal distribution law. This law was dis- 
covered by V. Pareto in 1897 [24]; actually, several different statements 
have at times been called ‘‘Pareto’s law.’’ In the introduction we shall 
very carefully distinguish between these statements, and we shall 
comment upon some existing theories of income distribution. 


* Manuscript received June 19, 1959. 

1 The work in this paper was started while the author was associated with the Uni- 
versity of Geneva (under partial support of the Rockefeller Foundation), then with the 
University of Lille and the Ecole Polytechnique in Paris. 
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We shall then introduce a new version of Pareto’s law which be- 
longs to the class of ‘‘stable’’ distributions, and which we shall designate 
as ‘‘the Pareto-Lévy law’’ (P-L).’ This law also explains partially the 
data relative to the middle-income range. 

The initial motivation for this law will be essentially twofold. First, 
among all the possible interpolations of the weak (asymptotic) Pareto 
law, the P-L law is the only one which strictly satisfies one (strong) 
form of the condition of invariance of the distribution, relative to the 
definition of income. Second, a P-L distribution is a possible limit 
distribution of sums of random variables. That is, as we shall demon- 
strate, the Gaussian distribution is not the only possible such limit (as 
is too generally assumed), and it is unnecessary (as well as insufficient) 
to try to save the limit argument by applying it to log U, instead of 
U, as is done in some theories leading to the ‘‘log-normal law’’ for U. 

The P-L law turns out to have many other very desirable proper- 
ties. Jt has also the drawback that it is limited to the interpolation 
of Pareto’s law, when its index a is such that 1 < a < 2, (see 1.1 for 
a definition of a). 

In a series of other articles, which has started with [18], [19], [20], 
we also study the P-L stochastic processes, a family of models of 
income variation. These processes again preserve certain desirable 
properties of the classical Gaussian processes without such drastic de- 
fects as the fact that income variation cannot possibly be Gaussian. 
It turns out that in the high income range, a P-L Markovian pro- 
cess can be approximated by a random walk of log U: that is, we 
are able to derive the fundamental initial assumptions of a model 
due to Champernowne [2]. Moreover, a P-L Markovian process be- 
haves quite reasonably in the intermediate range of incomes. 


1. INTRODUCTION 


1.1. The strong Pareto law. Let us begin with a basic variant, 
which will be referred to as the strong Pareto law. Let P(u) be 
the percentage of individuals with an income U (over some fixed period 
of reference) exceeding some number u (u is assumed to be a con- 
tinuous variable). The strong Pareto law asserts that: 


(uju’)-* when u>w 


P = 
(u) 1 when u <u’. 


2 This associates Pareto’s name with that of Paul Lévy, who first studied the stable dis- 
tributions intensively. (See in particular Lévy [10], [11], [12].) To the best of our knowl- 
edge the application of these laws to economics is entirely new as well as are the definition 
and properties of the ‘‘Pareto-Lévy”’ processes of [18], [19], [20] 
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Then, the ‘‘density’’ p(u) = — dP(u)/dw satisfies 


(a(u’)*u-er? when u> wu’ 
pu) = lo : 

( when u< uw’. 
This distribution is fully specified by two ‘‘state variables’’: u°, a scale 
factor, and a, which will turn out to be some kind of index of ine- 
quality of distribution. The strong law leaves the value of a undeter- 
mined (although in some cases, one immediately states that a> 1). 
We may also find the stronger statement that a = 3/2: the corresponding 
variant of the strong law will be referred to as the strongest Pareto 
law. -Graphically, the strong law implies that the (‘‘double logarithmic’) 
graph of y = log P, as a function of v = logu, is a straight line. 

The empirically observed P(u) are of course percentages relative to 
finite populations. They will further be considered as frequencies rel- 
ative to a random sample, drawn from an infinite population. [Em- 
pirically, the law holds as well for samples of a few hundred (the 
‘‘burghers’”’ of a city-state of the Renaissance) or of close to a hundred 
million (the taxpayers in the USA).] That is, U will be treated as 
a random variable with values u, and the curve describing the varia- 
tion of U in time will be treated as a random function. 

1.2. The weak Pareto law. The strongest Pareto law is now 
acknowledged to be empirically unjustified (and it should be noted that 
many purported ‘‘disproofs’’ of ‘‘the’’ Pareto law apply to this variant 
only). On the contrary, there is little question of the validity of the 
Pareto law if ‘‘sufficiently’’ large values of u are concerned. The best 
way of taking care of the limitations of the strong law is simply to 
state that: 


P(u) ‘‘behaves like’’ (u/u°)-*, as uo. 


This is the weak Pareto law. Such a statement is useful only if 
the exact definition of ‘‘behaves like’’ conforms to the empirical evi- 
dence and (taking advantage of the indeterminacy of such evidence) 
lends itself to easy mathematical manipulation. The following defini- 
tion of ‘‘behaves like’’ is usually found to be adequate: 

P(u) ~ (ujw’)? . 
That is: 
_P(u) —1l1,asu-o, 
(u/u’)-* 
or, 
P(u) = {1 + e(u)}(u/u’)-*, where e(u)— 0, as uo, 
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Graphically, this means that the curve (log P, logu) is asymptotic to 
the straight line which represents the strong Pareto law. 

1.3. Interpolation of the weak Pareto law. The range of values 
of u which is accounted for by Pareto’s analytic expression covers 
only a part of the total population. Elsewhere the ‘‘density’’ —dP(u)/du 
is represented by a quite irregular curve, the shape of which depends 
in particular upon the breadth of coverage of the data considered. 
This has been particularly clearly emphasized by H. P. Miller [22]. 
He has shown that the income distribution is skewed and presents 
several maxima, if one includes all individuals, even those with no 
income and part-time workers, and if one combines the incomes of 
men and women. However, the separate income distributions of most 
of the different occupational categories, as distinguished by the census, 
are both regular and fairly symmetric; the main source of skewness 
in the overall distribution and hence in Pareto’s law can be traced to 
the inclusion of self-employed persons and managers together with all 
wage earners. One may also note that the method of reporting in- 
come differs by occupational categories, and that, as a result, the cor- 
responding data are not equally reliable. 

The above reasons make it unlikely that a single theory could ever 
explain all the features of the income distribution or that a single 
empirical formula could ever represent all the data. As a result, it 
has been frequently suggested that several different models may be 
required to explain the empirical P(u); unfortunately, we know of no 
empirical check of this conjecture. In any case, the present paper 
will be devoted almost exclusively to a theory of high income data 
and of the weak Pareto law. It is unlikely that the interpolation of 
the results of our model will be able to explain finally all the middle 
income data. However, we shall not examine this point in great 
Cetail. 

1.4. Two laws of income distribution which are in contradiction 
with the weak Pareto law. 

A. Pareto himself has suggested the following possible improvement 
in his law: 


p(u) = — dP/du = ku-“*"exp(— bu) (where b>0). 


However, the weak Pareto law must be at least approximately cor- 
rect for large u. Hence, the parameter 6 must be very small, and 
the evidence against the hypothesis that ) = 0 is not strong. The 
choice, therefore, between the hypotheses 6=0 and b+0 may be 
legitimately influenced by ease of mathematical manipulation and ex- 
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planation. We shall see that the weak Pareto law (6 =0) has some 
crucial properties, which will be used as a basis for a theory, and 
which disappear if 6 +0. We shall therefore disregard the case b + 0 
(the more so since the formula ‘above provides no improvement for 
middle values of uw). (See 2.4 for a further discussion of 6 = 0.) 

B. The best known statement about income distribution, apart from 
the weak Pareto law, is the log-normal law [8], [1], which claims that 
the variable log U [or perhaps better log(U—u’), where w’>0] is well 
represented by the Gaussian distribution. The empirical evidence for 
this is that the graph of (P, ~) on log-normal paper seems to be straight. 
Such a graph emphasizes a different range of values of u from that 
of the Pareto graph, so that the graphical evidence for the two laws 
is not contradictory. The motivation for the log-normal law is, how- 
ever, largely theoretical (see 1.5). Roughly speaking, a log-normal U 
can be explained by assuming that log U is the sum of many additive 
components. 

1.5. Some existing models of income distribution considered as 
“‘thermodynamic’”’ theories. There is a great temptation to consider 
the exchanges of money which occur in economic interaction as analo- 
gous to the exchanges of energy which occur in physical shocks between 
gas molecules. In the loosest possible terms, both kinds of interactions 
‘“‘should’’ lead to ‘‘similar’’ states of equilibrium. That is, one ‘‘should’’ 
be able to explain the law of income distribution by a model similar 
to that used in statistical thermodynamics: many authors have done 
so explicitly, and all the others of whom we know have done so im- 
plicitly. 

Unfortunately, the Pareto P(u) decreases much more slowly than 
any of the usual laws of physics, so that if one wants to apply the 
physical theory mechanically, one must somehow argue that U is a less 
intrinsic variable than some slowly increasing function V(U). For 
that, one must renounce the additive properties of U. The seemingly 
universal choice for V is V = log U [or V’ = log(U — w’)]. This choice 
is suggested by the fact that one plots lou empirically. But it can 
also be traced back to the ‘‘moral wealth’’ of Berpouilli, and it ap- 
parently can be justified by some law of proportionate effect, or by 
some kind of Weber-Fechner law. If this choice of V is granted, one 
has to explain the normal law for the middle zone of v’s, and the 
exponential law P(v) = exp {— a(v — v’)} for the upper zone of v’s. 

Indeed, many existing models of the Pareto distribution are reduci- 
ble to the observation that (for any a) exp(— av) is a possible bar- 
metric density distribution in the atmosphere. Alternatively, con- 
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sider a set of Brownian particles, floating in a gas at a uniform 
temperature and density, the whole being enclosed in a semi-infinite 
tube with a closed bottom and an open top; assume further that the 
field of gravity is uniform. Then the density distribution which the 
Brownian particles attain as a state of final equilibrium is the ex- 
ponential. This is the result of a compromise between two forces, both 
uniform along’ she tube, i.e., the force of gravity, which alene would 
tend to pull ail particles to the bottom, and the influence sf beat 
motion, which alone would tend to diffuse them all to infinity. Clearly, 
the models of income distribution we are now considering involve an 
interpretation of the forces of diffusion and of gravity. 

Unfortunately, such interpretations are never sufficiently intuitive 
to exclude, for example, the removal of the counterpart of the bot- 
tom of the tube. The trouble is that such a removal changes every- 
thing. In fact, there is no steady limit state any longer, because all 
the Brownian particles diffuse to minus infinity. It is true that the 
Gaussian distribution does appear as a conditional distribution for V 
(this distribution is valid if one assumes that all the particles start 
from the same point), but this cannot really be a basis for a model 
of the log-normal law. 

This difficulty is already very clear in the simplest model of a dif- 
fusion, the so-called random walk, in which both time and V are 
quantified. Time is integral and V is a multiple of a unit v”’. This 
model (apparently introduced into economics by Champernowne [2]) fur- 
ther assumes (1) that the variation of V is Markovian, that is, V(t+1) 
is a function only of V(t) and of chance, and (2) that the probability 
that V(t + 1) — v(t) = kv” becomes independent of v(t) as v increases. 
Additional assumptions are required to get either Pareto’s law or the 
log-normal law, and the difference between these additional assump- 
tions has no intuitive meaning, which makes both conclusions uncon- 
vincing. However, the models which lead to the exponential are still 
slightly better: in fact, we can, perhaps, argue that the apparent 
normality of the ‘‘density’’ p(v) in the central zone of w’s simply 
means that — log p(v) may be represented by a parabola in that zone, 
whereas for large v’s, it is represented by a straight line. Such a 
parabolic interpolation needs no limit theorems of probability for its 
justification; it applies to any regularly concave curve, at Izast in the 
first approximation. 

In the case of models of the weak Pareto law, further non-intuitive 
assumptions are necessary to rationalize the empirical fact that a > 1 
in all cases. 
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We shall not analyze any of the other existing translations into 
economic terms of various models of the normal or exponential distri- 
butions, which occur in statistical thermodynamics, such as gas equilib- 
rium or Brownian motion. Because of their common ‘“‘thermodynamic’”’ 
character, these theories are essentially equivalent. In particular, 
they do not assume any institutio: al income structure; all income re- 
cipients are treated as entrepreneurs. A rewording of the classical 
theories, however, can make them applicable to many possible institu- 
tional structures of wage and salary earners; see Lydall [13]. 

In this article (see also [18]) we shall attempt to show that the 
analogy of statistical physics need not be abandoned, if we want to 
avoid the weaknesses which mar existing theories. We shall not re- 
quire the transformation V = log U, and we shall not try to justify 
the introduction of the economic counterparts of such conditions as 
the presence or the absence of a bottom to the tube in which Brownian 
motion is studied. That is, we shall not try to force income into the 
structure of statistical thermodynamics (even implicitly); rather, we 
shall attempt explicitly to generalize the statistical methods of thermo- 
dynamics, in order to cover the economic concept of income. 

We shall be successful only in the case where 1 < a@ < 2, a restriction 
which we shall discuss repeatedly. 


2. PARETO-LEVY RANDOM VARIABLES 


2.1. An analysis of the definition of income. One of the principal 
claims of this paper is that it is impossible to ‘‘explain’’ why Pareto’s 
law, and not some other law, is satisfied by income distributions, with- 
cut also studying the inseparable problem raised by the fact that 
essentially the same law continues to be followed by the distribution 
of ‘‘income,”’ despite changes in the definition of this term. Such an 
invariance is of course very important to census analyzers, because 
of the small effect of large changes in methods of estimating income. 
To study this problem one needs a practical way of expressing the 
different definitions of U. We shall argue that there are several 
ways oi distinguishing different sources of U. Therefore, U may be 
written in different ways as a sum of elements, such as: A, agricult- 


* We had indenpendently rediscovered this model and had discussed it in the original 
text of this paper, 2° submitted on June 19, 1959. The coincidence of the predictions 
of the P-L model and of the Lydall model for 1<a<2, shows that the distribution 
which corresponds to the least amount of organization could be “‘frozen’’ without modi- 
fying it, reinterprete’ as 2 wage distribution and then let to evolve along conceptually 
quite different lines. This fact has great relevance to the problem of the value of a 
and to the fact that a has recently tended to increase beyond 2, in highly organized 
industrial societies. 
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ural, commercial, or industrial incomes; B, incomes in cash or in kind; 
C, ordinary taxable income or capital gains; D, incomes of different 
members of a single taxpaying unit, etc... Let the income categories 
in a certain decomposition of U be labeled as U, (1<i<N). We shall 
assume that every method of reporting or estimating income corre- 
sponds to the observation of the sum of the U, corresponding to some 
subset of indices i. This quite reasonable consideration imposes the 
restriction that the scale of incomes themselves is more intrinsic than 
any transformed variables, such as log U.* 

Let the incomes in the categories U, be considered as statistically 
independent. This assumption idealizes the actual situation. A priori, 
this abstraction may seem a bad first approximation, since when in- 
come is divided into two categories U’ and U”, such as agricultural 
and industrial incomes, the observed values wu’ and w” are usually very 
different. We shal! find, however, that in the weak Pareto case, when 
the parts are independent, one would actually expect them to be very 
unequal; hence, although such an inequality cannot be a confirmation 
of independence, at least it does not contradict it in this case. 

Under these assumptions, the only probability law for income which 
could possibly be observed must be such that if U’ and U” follow 
this law (up to a scale transformation, and up to the choice of origin), 
then U'@U” must also follow the same law (the sign @ designates 
the addition of random variables). That is, whatever a’>0, b’, a” >0, b”, 
there exist two constants a > 0 and 5, such that 


(v’U +b) @(e"U + b”") =aU +b. 


Such a probability law is said to be ‘‘stable’’ (under addition), aad 
its density as well as the variable U itself are ‘‘stable.’”” The family 
of all laws which satisfy this requirement has been constructed by 
Paul Lévy [10]: it includes, naturally, the Gaussian law, but it also 
includes certain non-Gaussian laws, each of which turns out to satisfy 
the weak Pareto law, with some 0 < a < 2. In other words, the addi- 
tivity property of U, and the behavior of P(u), under the weak Pareto 
law (both of which disappear if the scale of U is changed) turn out 
to be precisely sufficient and necessary for the application of Lévy’s 
theory of stable laws. 

Further, the parameters of a non-Gaussian stable law can be ar- 
ranged so that E(U)< o, which requires 1 < a< 2, and so that 
~(— u) decreases very much faster than p(u), when w— co: in the 


* The same kind of division may also be performed in the direction of time. The prob- 
lem arises that it is unlikely that the year be an intrinsic unit of time. If it is not, how 
can a single law hold irrespective of the time unit chosen? See [18], [19], [29]. 
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latter case, the stable law may be called positive, an abbreviation for 
‘‘maximally skewed in the positive direction.’’ 

Since the usual terminology of probability (stable distributions) does 
not serve our purpose, we propose to refer to ‘‘positive’’ stable dis- 
tributions with 1 < a < 2, as ‘‘Pareto-Lévy distributions.”’ 

It is not strictly true that the distribution of income is invariant 
over the whole range of wu, with respect to a change in definition of 
income. We argue, however, that if the variable U is not Gaussian 
and is very skewed, the most reasonable ‘‘first order’’ assumption 
about income is that it is a P-L variable. 

The density p(u) of the P-L law unfortunately cannot be expressed 
in a closed analytic form, but is determined by its bilateral Laplace 
transform (valid for b > 0): 


G(b) = \" exp(— bu)|dP(u)| = \" exp(— bu)p(u)du 
= exp {(bu*)* + Mb}, 


which depends upon three parameters: a (1 < a < 2), u* (which is a 
positive scale parameter), and M (which is identical to E(U)). 
G(b) yields the result: 


Pluyrwu(wTl—a foru-o, 


where /‘(1 — @) is the Euler gamma-function. 

For other values of u, the Pareto-Lévy distribution had to be com- 
puted. A few sample graphs of densities appear in Figure 1. More 
detailed tables will be published by Mandelbrot and Zarnfalier [21]. 

We see that as long as a@ is not close to 2, the P-L density curve 
very rapidly becomes indistinguishable from a strong Pareto curve of 
the same @. [For this, the origin of the strong Pareto curve must 
be chosen properly and one must have u’ = u*J(1—a).] The two 
curves converge near u = E(U), when a is in the neighborhood of 3/2, 
and at even smaller values of u, when a is less than 3/2: that is, the 
asymptotic behavior of P(u), derived from G(b), is very rapidly im- 
plemented. As for large negative values of u, by the P-L law they 
have a probability which is not zero, but decreases so rapidly as u- — &, 
that they may be safely disregarded; one finds that 


a 





log [— log p(u)}] ~ 


log |u|. 
7 oe lul 


This is a faster decrease than in the case of the Gaussian density: 
see Appendix lI. 
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Finally, near E(U) and the most likely value of U, the P-L curve 
has the kind of skewness which one finds in the empirical data and 
which one hopes to derive in a theoretical curve. 

Let a now approach 2. At the limit, the P-L density will tend 
toward a Gaussian density. Close to the limit, the P-L density already 
resembles a Gaussian one. Only for large values of u, (which have 
a very small probability) is the Gaussian decrease of P(u) replaced by 
a Paretian decrease. 

In other terms, the P-L density is worth considering only if a is 
not ‘‘too close’’ to 2. Even in this range, however, the prediction for 
intermediate values of u is quite inadequate to cover the complete 
curve of incomes, as described for example by Miller [22]. But as 
part-time and unskilled workers are eliminated we more nearly approx- 
imate such a curve. On the other hand, the income distribution of 
unskilled workers is fairly symmetric and has a fairly small dispersion.° 
Since the distribution of unskilled wages differs from that of high 
incomes, the P-L law would a priori explain only the latter incomes. 
If we examine any income category as defined by the Bureau of Census, 
we can seldom tell in advance whether its income mechanism is closer 
to that of the unskilled workers or to that of the largest income re- 
cipients. Therefore, if we wish to determine how wide are the cate- 
gories to which the P-L law applies, the only solution is to start from 
the P-L interpolation of high incomes, and then to see how many 
other incomes (and which) must be added to obtain a good fit. This 
establishes a distinction between two kinds of income categories. The 
reasonableness of this distinction should be checked on independent 
grounds. For the time being we shall be content to observe that for 
a far enough from 2 the prediction made by the P-L law is ‘‘reasona- 
ble.”” (cf. 2.6.) 

We s° ‘| conclude this section with a few statemerts concerning 
non-P-L stable distributions. The Gaussian distribution is of course 
stable, and it is the only stable distribution with a finite variance. 
Further, all the stable variables with a finite mean are differences of 
P-L variables scaled by arbitrary positive coefficients. The bilateral 
generating function is no longer defined, and we must consider the 
usual characteristic function (c.f.). For a P-L variable, the c.f. is im- 
mediately obtained as 
w(t) = - et n(u)du = exp| — iMg —|¢|*(u*{1— te F cos |. 


5 We may even think that their incomes would become non-random if their occupations 
were entirely interchangeable and if their changes in type of employment were without 
obstruction and costless. 
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Hence, denoting by (1 + 8)/(1— 8) the ratio of the positive and negative 
components, the general stable variable, with E(U) < «, hasac.f. of 
the form: 


ott) = exe] — Me — Ig uty {1 — BE te hos] 


where u* >0,/8/<1andl<ac< 2. 

(We may note that the same formula, with 0 < a <1, gives another 
family of stable variables, with E(U) = co. The stable variables which 
do not fall into either of the above families are those of Gauss and 
of Cauchy and finally a small family of variables connected with those 
of Cauchy.) 

2.2. Another (equivalent) property of the stable laws. The stable 
laws have another (equivalent) property: they are the only possible 
limit laws of weighted sums of identical and independent random 
variables. That is, if U is decomposed into a sum of a large number 
of components U,, we need not resort to the above argument of ob- 
servational invariance; we know that (1) if the sum is not Gaussian 
(which is a conspicuous fact), (2) if the expected value of the sum is 
finite (which is also a fact), and (3) if the probability of —w is much 
less than that of uw, for u > 0 and large, then no hypothesis about the 
distribution of the parts is necessary to conclude that the sum can 
only be a Pareto-Lévy variable. We shall develop these points in 
more detail, and in 2.6 we shall show that the above restrictions may 
be reduced to broad ‘‘qualitative’’ properties, such as the likelihood 
that two independent variables U’ and U” contribute unequally to the 
sum U’@ U". 


® Every stable law is such a limit. To prove this, assume that the variables U; them- 

selves are stable; by induction of stability, the @-sum LliLUs will be a variable 
of the form a(N)U +0(N) and hence {a(N)}- “(5 %,U;) — (N)} will have the same 
distribution function as U. Conversely, assume that a certain normed sum of the variables 
U; has a limit. Write the N-th normed sum, taken in the sense of @, as 


A(N) ¥U, - BON) 


AW) 


_ AW) 
A(m) 


“x {acm EU. - Bn) @ aay 


{Atm 5 FU, - Bm) } + CIN) 


where N=n+m. By letting both nm and m tend to infinity, we find that the definition 
of a stable variable must be satisfied by the limit of AN) E2210; — BIN) + C(N). 

We should also mention the following theorem of Gnedenko. For the convergence of 
N~“* 57,1; - BUN) to a stable limit, it is necessary and sufficient that 0 <a <2, 
and that foru > 0, F(u)=1— P(u)=1 — {1 + e’(u)}(u/u)-*, where e/(u) » 0, as u > o, 
and for u <0, F(u) = {1 + e’"(u)}(u/us)-*, where e’’(u) +0, as u > —o. 
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It should be noted that an expiznation of the distribution of income 
in terms of the addition of many contributions does not involve the 
log U transformation. This avoids the usual limit argument, which 
leads to the log-normal law and contradicts Pareto’s law. 

These results may be better than expected because the limit distribu- 
tion might not hold for U, which is a sum of only a few random com- 
ponents. That is. if one wants to increase the number of components 
of U, one must abandon the hypothesis of independence at some point: 
the more U is subdivided the less independent the components become. 
This is a difficulty which is not proper to the problem of income, but 
is acutely present in all social science applications of probability theory.’ 
Of course, the principle of the difficulty appears in physics for example 
in explaining why some kind of noise is Gaussian. But in most physi- 
cal problems there exists a sufficiently large zone between the systems 
which are so small that they are impossible to subdivide and those which 
are so large that they can no longer be considered as homogeneous. 
No such zone exists in most problems of economics, so that a seemingly 
successful application of a limit theorem may seem too good to be true. 

To sum up, it cannot be strictly true that the additive components 
are independent and have the same distribution (up to scale). However, 
the ‘‘likelihood’’ of any distribution is greatly increased if this is the 
only one reducible to limit arguments. Hence, if a sum of many 
components is not Gaussian, is skewed, and such that E(U) < ~, 
then the most reasonable first assumption concerning the sum is that 
it follows the Pareto-Lévy law. 

More precisely, two extreme interpretations are possible. The ‘‘mini- 
mal’’ interpretation regards the Pareto-Lévy distribution as just an- 
other law which is correct asymptotically, is sufficiently easy to handle, 
and is useful in the first approximation (after all, the Gaussian it- 
self is frequently a good first approximation to distributions which, 
actually, are certainly not Gaussian). 

At the other extreme we may take the Pareto-Lévy law entirely 
seriously, and try to check its ability to predict some properties of 
income distribution which otherwise would seem independent of the 
weak Pareto law. We think that such predictions were in fact achieved. 
This provides some claim for a ‘‘maximal’’ interpretation of the P-L 
‘aw, which regards its invariance and limit properties as being ‘‘ex- 
plicative.’’ 

2.3. Stable laws are very well known in abstract probability theory 


7 In estimating a mean by an empirical average, often the sample is too small or the 
many available data are strongly correlated. 
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(cf. Lévy [10], [11], [12], Gnedenko and Kolmogoroff [8}j). We might, 
therefore, have introduced the P-L law without special motivation 
other than the fact that the known asymptotic behavior of P(u) and 
its recently computed behavior for intermediate u make it an attrac- 
tive interpolation for the income data (Mandelbrot [18]). Unfortunately, 
the behavior of the P-L law is in many other ways quite different 
from that to which most statisticians are accustomed from their con- 
stant handling of the Gaussian law. The only way of really showing 
how far this law is suited to the present problem is to sketch its 
theory, starting with a heuristic introduction to its properties (from 
the viewpoint of addition and of the study of extremal values), and 
ending with some rigorous results. 

We see how the three main drawbacks of the classical theories 
vanish. Many variants of the same argument ali lead to the same 
result; no change of scale of U is necessary, the behavior of P(u) 
being exactly what is needed, and a is ‘‘near’’ 3/2. In fact, a should 
not be too close to 2, and cannot be greater than 2, a point which 
requires some elaboration. 

2.4. Concerning the sign of a—2 and the variance of U. A few 
empirical data claim to lead to a >2. To explain this by a limit argu- 
ment requires most specific and unlikely assumptions. Similarly, the 
observational invariance of the stable distribution cannot hold if a > 2, 
unless it is weakened (2.5) to require that the sum of a few weak 
Pareto variables with a > 2 is a weak Pareto variable with a > 2; as 
the number of addends increase (2.8), the weak Pareto law holds for 
a decreasing zone of values of wu and finally the sum tends towards 
a Gaussian variable. 

We find that the sign of a — 2 distinguishes two sets of different 
random variables. The occurrence .of a > 2, or even a < 2 but close 
to 2, may mean a predominance of salaries (Lydall’s income model [13] 
is valid for all values of a). (Another explanation may be found in 
Mandelbrot [19].) On the other hand, several of the examples where 
a@ > 2 are encountered occur in communities of Oceania which are 
very much less self-contained than, for example, Great Britain or the 
U.S.A. Their distributions of U may perhaps be truncated. In any 
case, the sign of a — 2 raises an important empirical problem: Is the 
fit of the weak law equally good regardless of this sign, and does this 
law represent a higher percentage of the data where a < 2? 

A second consequence of the limitation 1 < a < 2 is that the second 
moment of U is then infinite, the first momenié being finite. This is 
easily seen, because by integration by parts, 
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E(U) = [7 ud F(u) ages |" ud P(u) “ it P(u)du < 
and D(U) = E(U’) — [E(U)P where 


E(U*) = | wdF(u) = ‘i wdP(u) = 2" uP(u)du =o, 
(We assume that the behavior of P(u) for negative u does not lead 
to a convergence problem for E(U).) 

The last result holds for both strong and weak Pareto variables if 
1<a<2, but it would, for example, cease to hold for the density 
p(u) = ku-*” exp(— bu) of 1.4. This provides a new and important 
test of our conjecture that b=0. The finite H(U) means that if uw, 
are samples from a Pareto distribution, the empirical mean EF, = 
> 2, u,/N tends to E(U) with probability 1 (Kolmogoroff’s strong law 
of large numbers); in addition, Ey is a good estimate for E(U), if N 
is large. Now consider Dy = }>%, (u,— Ey)'/N. If a>2, Dy tends to a 
finite limit D(U), of which it is a good estimate, and }>%, (u,— Ey)/VN 
tends to a Gaussian variable. This result changes little if one adds an 
exponential factor with small 6. However, if a< 2, and })=0 the 
limit of Dy, is infinity (one may show that Dy grows without limit 
like N**-") but if 6 becomes > 0, D(U) is finite. Therefore, the use- 
fulness of the exponential factor exp(— bu) may be tested by check- 
ing whether Dy, keeps increasing with N in the case of the largest 
of the samples available. We could not make the direct test, but an 
indirect test results from the following observation: the ordering of 
the different populations by ‘‘increasing inequality’’ should presumably 
be identical with their ordering by decreasing a; on the other hand, 
more usual measures of inequality are given by Dy, Dy/Ey or /D,/Ev. 
These two methods of ordering populations have been compared and 
found entirely contradictory. This result ceases to be absurd if one 
takes account of the fact that the values of N in the different sam- 
ples which were compared range from 10° to 10°; and it seems that 
even if D(U) were finite, it would not be approached, even with the 
largest samples. In that case, irrespective of any theory, it is pref- 
erable to take 6=0, and D(\U) =. Further, no function of Dy, 
is adequate to compare degrees of inequality, except perhaps between 
samples of identical size. 

Another test of the usefulness of the approximation D =o is pro- 
vided by the relative contribution to D, of the largest of the w,: this 
is very large (close to 1/2 for the Wisconsin incomes [23]), as might be 
predicted from the theory of the Pareto-Lévy law. The usual procedure 
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of truncating U in order to avoid D = distorts the whole problem. 

2.5. Heuristic study of the addition of two independent random 
variables, particularly in the Gaussian and weak Pareto cases. Let 
U’ and U” be two independert random variables, having the same 
probability density p(u). We shall compare the behavior of p(u) for 


large u, and that of p,(u) = r p(x)p(u — x)dx, which is the density 
of the sum U=U’@U”. It will be assumed that u may vary to 
+00, 

One basic case is 


pu’) = {© exP(— bu’) . U >0, 
0 if uw <0. 
Then, 


p{u) = \-c exp(— bx) exp[— b(u — 2)|dzx 
= [rc exp(— bu)dx = C*u exp(— bu). 


Note that in this case log[p(u’)] = logC —bw’ is represented by a 
straight line. The two next simplest cases are those where —log[p(u’)] 
has the same convexity, up or down, over the whole range of varia- 
tion of wu’, so that the expression —log p(x) —log p(u — x) has an ex- 
tremum for x = u/2. These two cases are illustrated in Figure 2. 

One kind of convexity arises when d’logp(u)/du? < 0, for all wu. 
Then p(u) decreases rapidly as u-—+ o, and the integrand p(x)p(u — x) 
has a maximum for x = u/2. If that maximum is strong enough, the 
integral \" p(x)p(u — x)dz is likely to be made up to a great extent 
of the contributions of the x’s in a small interval near u/2. Hence, 
a large value of u is likely to come from two contributions wu’ and 
u” which are almost equal. (This is not obvious, and in particular it 
is not true if the concavity of —logp is turned the other way.) 

Consider for example the Gaussian density 


1 
OV 2x 


x? 


p(x) oa rw, . 





exp(— 


It is known that 


pu) = —)_ exp (— =) ; 
OV2 V2z 4a° 


so that the Gaussian character of density is preserved in addition, 
except for the value of g¢. The same result can also be obtained heu- 
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ristically, by arguing that p(x)p(u — x) stays near its maximum value 
{p(u/2)}* over some interval D/2 on each side of u/2, and is negli- 
gible elsewhere. In that case, one gets the approximate estimate: 


D exp(— +) 2 

o2r 40° 

In other words, the approximate estimate is correct, if one takes D = 
01’x independent of wu. 


Suppose now that p(u) decreases slowly, so that d* log p{u)/du* > 0, 
for all u. Then p(x)p(u — x) has a minimum for x= u/2. It may 





p{u) ~ p(u/2)p(u/2)D = 


/ 
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FIGURE 2: DISTRIBUTION OF U’ WHEN U’Q@U" IS KNOWN 


be noted that p(x) must — 0 as x o and must either =0 or —0 
as x-—+—co. This is compatible with a —log p(x) concave downwards, 
but only if the range of variation of x is truncated from below. As- 
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sume that wu’ can only be > wu’, so that its most probable value is w’. 
In that case, p(x)p(u — x) will have two maxima, for x = u° and for 
z=u-—u’. If they are strong enough, p,(u) will be composed mostly 
of the contributions of the neighborhoods of these maxima. Compare 
then the following two expressions: 


pu) =2\"" payplu —a2)dz, and 2)” »(x)p(u)da 


These two expressions differ only in the small contributions of x’s very 
different from x = 0; that is, 


u/2 é 
pfu) ~ 2p(u) \ p(x)der . 
Finally, if « is large, i p2)de~ ¥ p(x)dze, so that 
0 0 


p{u) ~ 2p(u) . 
Hence, 


|" péayde = Pu) ~ 2P(u) 


Note that a large value of u is now likely to be the sum of a relatively 
very small value of either u’ (or u”) and of a value of u” (or u’) 
very close to u—u’. The two addends are likely to be of very un- 
equal size; but the problem is entirely symmetric, so that the mean 
value E(u'/u) is still u/2, by compensation. (See also Appendix II.) 

We have in addition proved that the distribution of the largest of 
two variables U’ and U” of the slowly decreasing type has the same 
asymptotic behavior as the distribution of their sum. Another deriva- 
tion of this result starts from the derivation of the distributivun of 
max(U’, U”). Clearly, P™(u), the probability that uw be larger than 
max(U’, U") is the probability that u be larger than both w’ and w”. 
Hence, 


1— P™%u) = [1 — P(u)P. 
For large u and small P(u), this becomes 
P™(u) ~ 2P(u). 
That is, for slowly decreasing densities, P"(u) ~ P,{u). 


A prototype of the slowly decreasing variable is the strong Pareto 
variable. In ‘aat case, we can write: 


Pu) ~ 2P(u) = P(2-"*u) . 
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That is, the sum of two independent and idential strong Pareto 
variables is a weak Pareto variable, with the same a and a scale 
factor u° multiplied by 2"*. 

Likewise, any weak Pareto distribution will be invariant in addition 
up to the value of wu’. The proof requires an easy refinement of the 
previous argument, to cover the case where —log p(x) is not concave 
or convex all through the range of x, but d’log p(x)/dxz? becomes and 
stays negative for large values of x (which implies a quite regular 
behavior for —log p(x) in that region). One can show in this way that 
the weak Pareto law is preserved in the addition of two (or of a few) 
independent random variables: there is no self-contradiction in the 
observed fact that this law holds for parts of income as well as for 
the whole. That is, the exact definition of the term ‘‘income’’ may 
not be a matter of great concern. But, conversely, it is unlikely that 
the observed data on P(u) for large u will be useful in discriminat- 
ing among several different definitions of ‘‘income.’’ 

The weak Pareto and the Gaussian are the only laws strictly having 
the above invariance (‘‘stability’’) property. They will be distinguished 
by a criterion of ‘‘equality’’ versus ‘‘inequality’’ between wu’ and w”, 
when u=wu’+ wu” is known and large. (2.6.) In 2.8 we shall cite a 
further known result concerning stable probability distributions. 

We may also need to know the behavior of p,(u), when the density 
p’(u’) of U’ decreases slowly and p’’(u”’) decreases rapidly. in that case, 
a large wu is likely to be equal to uw’, plus some ‘‘small fluctuation.’’ 
In particular, a Gaussian error of observation concerning a weak Pareto 
variable is quite negligible for large w. 

2.6. The problem of addition and of division into two, in the case 
of stable variables. We have shown that the behavior of the sum of 
two variables is determined mainly by the convexity of —log p(u): we 
shall later show that this criterion is in general insufficient to study 
the addition of many variables. However, if we limit ourselves to 
stable random variables, the convexity of —log p(u’) is sufficient to 
distinguish between the case of the Gaussian and that of all other stable 
distributions. That is, these two cases may be distinguished by the 
criterion of approximate equality of the parts of a Gaussian sum con- 
trasted with the great inequality between these parts in the case of 
all other distributions, in particular the Pareto-Lévy one. 

This distribution has been used so far only to derive the distribution 
of U’@U”. Suppose now that the value u of U is given and that 
we wish to study the distribution of uw’ or of u”=u-—vu’. If the 
a priori distribution of U’ is Gaussian, with mean M and variance o’, 
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then the a priori distribution of U is Gaussian with mean 2M and 
variance 20, and the conditional distribution of wu’, given wu, is: 





(2n)-"¥¢- exp| lent Jano" exp| — tet Sout I | 
p(u!|u) = 6 oe 
2-"(277)-V4g-! exp| — (u hae ) ] 
4a" 


= 2"(27)-40- exp| — A = ter | t 


This is Gaussian, with mean value u/2 and variance o*/2. A striking 
feature of the result is that the law of u’ depends on u only through 
the mean value of U’ (see the right-hand side of Figure 2). 

Let us now consider the non-Gaussian case. Insofar as our theory 
is adequate in income studies, the problem of division will appear in 
such questions as the following: if we know the sum of the agricul- 
tural and industrial incomes of an individual and if the a priori dis- 
tributions of both these quantities are Pareto-Lévy with the same a, 
then what is the distribution of the agricultural income? This case 
is more involved than the Gaussian one, because we do not know any 
explicit analytic form for the distribution of wu’ or uw”; we can, how- 
ever, do some numerical plotting (see the left-hand side of Figure 2). 

If the sum u is very large, we find that the distribution of w’ has 
two very sharp maxima, near w’,,,, and u—w’,,,. As u decreases, 
the shape of this distribution of wu’ will change, instead of being simply 
translated, as in the Gaussian case. When u becomes small, more 
maxima may appear. They will then all merge, and the distribution 
of u’ will differ little from that which is valid in the Gaussian case. 
Finally, as « becomes negative and very large, the distribution of w’ 
will remain one with a single maximum. 

Hence, bisection provides a very sharp distinction in this respect 
between the Gaussian and all other stable laws. 

Now, consider a fairly small number N; what is the distribution of 
(1/N)-th of a stable variable? In the Gaussian case this (1/N)-th re- 
mains Gaussian, whatever N may be; its mean value is u/N, and its 
variance (N — 1)o°*/N. In the non-Gaussian case, each part of a large 
u may be small or it may be close to u; most of the N parts will be 
small, but the largest of them will be likely to be close to the whole. 

“he situation is less intuitive when N becomes very large. How- 
ever, Lévy has proved that the necessary and sufficient condition for 
the limit of the @-sum UU, to be Gaussian is that the value u, 
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of the largest of the summands be negligible compared to the whole. 
On the contrary, if the limit is stable non-Gaussian and such that 
E(U) = 0, both the sum and the largest of the summands will increase 
roughly as N’”*, and one can even show (Darling [4]) that their ratio 
tends towards a limit, which is a random number having a distribu- 
tion dependent upon a. 

Let us now return to the discussion of 2.2. We argued there that 
U is the sum of N components, without knowing N. A posteriori 
this turns out to be quite acceptable, because the largest (or the few 
largest) of the components will cover a substantial part of the whole, 
essentially independent of the number N of components. This emi- 
nently desirable feature of the P-L theory is an important confirma- 
tion of its usefulness. 

If, however, a P-L income is small, its components are likely to be 
of the same order of magnitude as in the Gaussian case. This has 
an important bearing upon the problem touched near the end of 2.1. 
Assume that one has a Census category in which income is usually 
rather small and which is such that when U is decomposed into parts, 
the sizes of the parts tend to be proportional to their a priori sizes. 
One may assimilate this behavior to that of a Gaussian unskilled work- 
er’s income (the decomposition may now refer to such things as the 
lengths of time during which parts of income were earned). But such 
behavior may also be that of a P-L variable, considered for small 
values of wu. Asa result, the seemingly fundamental probiem of split- 
ting income into two parts so that only one follows the P-L law is 
bound to have some solutions which are unassailable, but impossible 
to justify positively. Hence, it is questionable whether this problem 
is really fundamental. 

2.7. Addition of many weak Pareto variables; possible asymptotic 
invariance of the weak Pareto density if 0 < a < 2; non-invariance 
in the case a>2. Applying the reasoning of 2.5 to the @-sum Wy = 
>2,U, of N weak Pareto variables, we obtain the following value 
for the function P, of the variable Wy, 


P,(u) ~ NP(u) ~ P(uN-"*). 


This relationship may be expressed alternatively as follows: let a be 
any small probability, let u(a) be the value of U such that P[u(a)] = a, 
and let wy(a) be the value of Wy such that P,[w,y(a)] =a. The above 
approximation then becomes 

w,y(a)~ N*u(a) . 


Either way, the distribution of W,N~‘* would be independent of 
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N for large values of w; in other terms, W, would ‘‘diffuse’’ like N”*. 
Actually, there are two obvious limitations to the validity of the ap- 
proximation Py, ~ NP. 

The first limitation is that when a> 1 and hence E(U) < © this 
approximation can, at best, be valid if E(U)=0. This gives the only 
possible choice of origin of U, such that when N— o, the distribution 
of W,N-"* can tend to a limit, over a range of wy such that Py(wy) 
does not decrease to zero. 

The second limitation is that when a > 2, the relationship Py ~ NP 
can at best hold in a zone of values wy such that the total probability 
P,(Wy)— 0 as N-— . This is due to the fact that U now has a finite 
variance D(U), and it is possible to apply the classical central limit 
theorem to U. That is, we can now assert that, when No, 


W, — NE(U) 
hoy <" 





1 z 
tends to <Fex |" exp(—v'/2)dy 5 


This holds over a range of values of x, of which the probability P,—1 
as N— oo. That is, an increasingly large range of values of Wy will 
eventually enter into the Gaussian zone, in which W, — NE(U) diffuses 
like N’? (that is, more rapidly than N’’*). As a counterpart, the total 
probability of the values of x, such that Py~NP, must tend to zero as 
N— o, 

Note that the N”” diffusion is not radically changed if U is trun- 
cated to less than some fixed bound; that is, the N’? diffusion repre- 
sents the behavior of those values of W, which are sums of comparable 
small contributions, whereas the N’* diffusion represents the behavior 
of sums of a single large term and of many very small ones. 

We can now draw some conclusions concerning the relationship be- 
tween the behavior of the largest of N terms U, and the bel avior of 
their sum. In the weak Pareto case, the two problems are identical 
if N = 2, whatever a; but as N increases they become distinct prob- 
lems. The problem of the maximum is the only one which remains 
simple and continues to lead to a weak Pareto variable. On the con- 
trary, the concavity of —log p(w’) is not a sufficiently stringent criterion 

® To show this, write V= U+c; then Vy = Uy + Ne and VwN-* = UyN-Ve + 
cN'-Ve. When N-> o, the last term will increase without limit, so that VwN~-/@ can- 
not have a non-trivial limit distribution, if UyN~-/* has one. Further, if UyN~-/@ has 
a limit distribution. Uy/N wili have the degenerate limit 0, so that we must assume 
that E(U) = 0. 

If, on the contrary, a <1, the above argument fails, because N'!-'/@ tends to zero; 


therefore WyN~-¥@ could have a limit distribution on a non-decreasing range of values 
of U, whatever the origin of U (anyway E(U) = ). 
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to discriminate between those cases where P, ~ NP does or does not 
apply, over a range of values of u having a fixed probability. 

2.8. Infinite divisibility of the stable distributions. If U is stable, 
we can write, designating by }- the sums in the sense of @, 


U= A(N) S00, — BIN) = D {A(N)U, — BIN)/N} = DVy,, 


where the U, are independent values of U and where Vy, = A(N)U, — 
B(N)/N. Hence, for every N, U can be considered as a sum of N 
independent and identically distributed variables, V,, This is the 
definition of infinite divisibility for a random variable. Suppose that 
U is a P-L variable with E(U)=0. Then the infinite divisibility of 
U is made obvious by writing the c.f. y(f) in the following fashion: 


log g(t) = Cle ~1— itz) | d(x-*)|. 


To divide a P-L variable by N we need only replace C by C/N. This 
preserves the form of the function log y(f), as it should, because 
(1/N)-th of a stable variable is itself stable. If ¢ is chosen small 
enough, we may approximate log y(g) by the integral log @(f, «), re- 
stricted to the range («<< 4 << ). Further, the term igC\ 2! d(x-*)| 
of log p(f, €) amounts to a non-random translation of U. The essential 
part of log p(f), is, therefore, the part C\ (e%* — 1)|d(x-*)|, which is 
a limit of approximations of the form > (e's — 1)| A(x-*)|. 

We may thus represent U as a limit of sums of Poisson variables. 
To each increment dx of the variable x there is a corresponding con- 
tribution to U equal to x multiplied by a Poisson variable of expected 
value C\d(x~*)|. This means that a Pareto-Lévy variable may be 
considered as a sum of variubles, each of which is closely related 
to the strong Pareto law. (The strong Pareto law may be truncated 
at any ¢>0, because the term ifx of log g(t) takes care of the 
divergence near 0 of the integral | (eS? -- 1)| d(x~*) |.) 

0 

This relationship between the strong Pareto and the P-L laws and 
the role of the convergence factor ifx|d(x-*)| are made quite clear 
in a very intuitive physical problem of Hoitsmark [9]. 

The problem is that of the attraction exerted on a star by an in- 
finite uniform cloud of identica! stars. Let us postpone convergence 
problems and consider first a very large sphere of radius R, with'n 
which WN stars of unit mass are distributed at random, uniformiy 
and independently. A final star is located at the center 0 of the sphere, 
and we wish to compute the resultant of the Newtonian attractions 














102 BENOIT MANDELBROT 


exerted on 0 by the N other stars. Units will be chosen in such a way 
that two stars of unit mass attract each other with the force r~? = wu. 
Let D = N(4R‘'z/3)-' be the average density of stars, and let u* = R~’. 
Consider first the attraction of the stars located within a thin pencil 
(or infinitesimal cone) covering dS spherical radians, having its apex 
at 0, and extending in one direction only from 0. This pencil is a sum 
of cells, each of which is contained between some radius r and some 
radius r+dr. The volume of such a cell is dV = dSd(r*) = dS |d(u-"")|. 
Therefore, if there were a single star in the pencil dS, the probability 
of its being in the volume dV would be given by the strong Pareto 
distribution: 
dS -d(u-*") 
dS-R' 


The characteristic function of this distribution is a fairly involved func- 
tion p(fu*). If there were N stars in the pencil, the probability that 
the attraction on 0 is u would have the c.f. 9”(fu*), which becomes 
increasingly more involved as N— o. 

The problem is simplified if (D remaining constant) R-—o, and 
N— oo. It then becomes possible to assume that the number of stars 
in the cell of volume dV is not fixed but is given by a Poisson ran- 
dom variable, with the same D as the expected density. It is clear 
that one can easily go from one case to the other by slightly chang- 
ing the distribution of stars for large R, an area of small values of w. 

In this Poisson approximation, the stars located in the volume dV 
will exert a total force which is a multiple of u = r~’, the multiplier 
being a Poisson variable with expected value D-dS-|d(u-**)|. That 
is, the total force exerted on 0 will be the sum of a number of in- 
dependent discrete jumps. (The mean relative number of jumps, with a 
value between some a and some }, will be D-dS-(a-*? — b-*")/D-dS-u*-"?, 
i.e., it will follow the strong Pareto law.) Thec.f. of the total con- 
tribution of the pencil dS will then be approximated by the integral 


log 9 4(6) = (D-dS)\"_,(e — 1)|d(u-" |. 





= |d(ufur)*” 





The fact that the integral is extended to u = © raises no convergence 
difficulty, but the careless extension of the integration down to u = 0 
(R = co) would lead to a divergent sum. The physical reason for 
this is that, although each of the distant stars contributes little at- 
traction, their number is such that their total mean attraction is in- 
finite. However, one may disregard this infinite mean value, because 
the difference between the attraction and its mean is finite; indeed, 
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the fluctuation of the contribution of far away stars has the charac- 
teristic function 


-2 
#(¢) = exp{(D-dS)\" (es — 1 igu)} |d(u-™|, 
which converges and tends to zero as R-—+ oo, For the sake of con- 
venience, the same correction ifu may also be used for all other values 
of u, since its effect for large wu is only to add a finite translation to 
U. Hence, the difference between the attraction of the stars in the 
pencil dS, and the mean value of this attraction, is a ‘‘positive’’ 
stable variable, or ‘‘Pareto-Lévy’’ variable, with a = 3/2. 

The meaning of the stability of this attraction is that if two clouds 
of stars are distinguished by their colors (say red and blue) but have 
the same density and fill the same pencil dS, then the forces exerted 
on 0 by red or blue stars alone, or by both together, differ only by 
a scale factor and not in the analytic form of their distributions. 

It is also evident why large negative values of wu are unlikely com- 
pared to large positive values. A large negative uw can occur only if 
there is an abnormally sma'l number of stars. Moreover, the absence 
of any stars near 0 is a quite likely event, but alone it can at best 
give a bounded negative wu. Therefore, a large negative u must also 
contain the negative contributions of stars missing far away from 0; 
each of these stars contributes little to U, so that the number of miss- 
ing stars must be very large, and this is very unlikely. 

On the contrary, an unboundedly large positive u may be obtained 
from the presence of a single star near 0, irrespective of the density 
of stars elsewhere; such an event is far more likely than the combi- 
nation of events required for a negative uw. It is easy to check the 
fact that the distribution of U has the same asymptotic behavior for 
large u as the distribution of the attraction of the nearest star. 


8. CONCLUSION 


It is to be hoped that this first application of the stable laws will 
stimulate interest in a more detailed study of their properties. Most 
of the usual procedures of statistics must be revised because of the 
infinite variance of U; such new problems arise, as that of the best 
choice of the origin uw”, in such fashion that the P-L density p(w) and 
the strong Pareto density [(u — w”)/u’]-* coincide over as large a range 
of values of wu as possible. 

Another series of problems concerns the comparison of the P-L curve 
with the empirical data in tie region of intermediate values of w. 
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A final problem concerns the sign of a— 2. We have referred to 
it several times, but a further discussion can be pursued only within 
the framework of a theory of P-L processes. (One indication may 
already be found in Section 2 of [19|.) To settle this problem, it will 
probably be necessary to introduce some dependence between the addi- 
tive components of U; this must be done carefully, however, if one 
wants to avoid obtaining a wholly indeterminate answer. See also 
footnote 3. 


International Business Machines Corporation, U.S.A. 


APPENDIX I 


The behavior of the P-L density for large negative arguments is not classical. To derive 
it, we note first that the form of the bilateral generating function G(b) (which is not com- 
monly used) may be deduced from the commonly used characteristic function, with the 
help of some standard theorems on Fourier transforms in the complex plane. From the 
existence of G(b) it follows that p(u) must decrease faster than any form exp(—|bu)), 
when u—> —o. 

To show exactly how fast p(u) decreases, write for convenience v= —u and jf{v)= 
—log p(v); flv) will increase faster than any linear form of v. Write further: 


Gib) = \ exp(bv)p(v)dv = \ 


exp [bv — flv)]dv= | exp [A(v)]dv . 
If 6 is large, the integrand exp(h) must have a maximum for the value of v such that 
b= f'(w). Near that point, we can write: 


h(v) = [bw — f"(w)) — (1/2Xv — w)Pf""(w) + (1/6Xv — wPfOX(w) + ---. 


If we could limit oneself to terms of order 0 and 2, we would find for G(5) the approxi- 
mation exp[bw — f’(w)][2x/f’"(w)]-"2. Let us investigate the possibility of equating this 
approximation to exp(b*), with a f(v) of the form Kv*; if we take account of the term 
exp[bw — f’(w)] alone, the approximation may indeed be made equal to exp(b*) by tak- 
ing ¢=a(a —1)-1. The term [2x/f’’(w)]-'/? weakens the result somewhat, since, instead 
of having log[—log p(v)] = log K + clogv, we can only assert that log(—logp)/logv—c, 
as v- co. Finally, consider the terms other than those of orders 0 and 2: we note that 
the term in (v — w) gives a non-negligible contribution to G(b), only as long as (v — w) 
is of the order of magnitude of [f!"(w)]-/2 ~ w!-e(%(e~-1)]~', Then the term in (v — w) is 
of the order of magnitude of w-2(%e-1)]-' and is negligible; similarly the terms of 
higher order do not modify the behavior of p(v). 


APPENDIX II 


We can exhibit the invariance of the weak Pareto variables in a specific example. Let 
0<a<.1, and consider the discrete variable having the following Z(b) as discrete (one- 
sided) generating function: 


Z(b) = E exp(—bn)p(m) = 1-Cil-e-* (where 0<C<1). 
n= 


All p(n) are positive and less than 1, and Sn-: p(n)=1. For large n, 
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p(n) ~ oe : 
The sum of two variables of this type has the generating function: 

Zx{b) = Z%(b) — 1 — 2C(1 — e-*)* + CX1 — e- 20, 
so that 


2Cn-(e*) C%- Ger) 
Pie) ~ Ta) T'(=2a) 





= 2p(n) + correction . 


For large n, the second part of the right-hand term becomes negligible compared to the 
first. If a= 1/2, it is zero, so that the range of values of n in which it may be neglected 
increases as a tends to 1/2. 

If 1<a< 2, we must add a factor in (1 — e~°) to obtain an acceptable generating func- 
tion; consider for example: 


Z(b) = 1 — C(l — e~*) + C"(1 — e-*) 


where 0 < aC’ < C< C’ +1, so that C’ < (a — 1)-!. 
Now 

2Cn-(*D = 2CC "(a + 1)n-@* Cn - a+) 
T(—a) T(—a) + “T(=2a) 





pin) ~ 


For large n, the second and third terms become negligible for all a. The ratio of the 
coefficients of the first and of the second term depends little upon a; but the ratio of 
the coefficients of the third and first terms is ruled by /'(—a)/I'(—2a), which is zero for 
a = 3/2, but may become large elsewhere. As a result, the range of values of n over 
which the third term is important may be large. 

Each time a increases past an integral value, the sign of C(l — e->) must be changed, 
and another polynomial term must be added to Z(b), to keep it a generating function. 
The number of corrective terms of s(n) — 2p(m) increases, as well as the range of values 
of m in which the corrective terms are appreciable. 

Similarly, as more than two terms are added, py(n) — Np(n) is vitiated over an in- 
creasing range of values of n. Let N-» «, and observe the weighted sums of the varia- 
bles U;, whose values are the integers n. (>> remains a summation in the sense of @.) 

For 0 < a <1, it is sufficient to consider the expression Wy = N-* ¥'U;, since its 
g.f. is Z%(N-1/2b), which tends to exp(—Cb*) when N -> ~, as it should. 

If 1 <a < 2, one must consider the expression Wy = N-@ >\(U; — M), where M, the 
mean value of U;, is easily found to be C. The g.f. of Wy is clearly exp(NCb)Z*(N-1/2b) 
and when N — o, it tends to exp(Cb), as it should. It is easily seen that if we choose 
for M a value different from C, the g.f. of Ww will not tend to a non-degenerate ex- 
pression. 

If the value of a is higher, no linear renorming of U; can eliminate from log[Z(b)] 
the square term Kb?, and hence, the best normed sum of the U; is the classically normed 
N-2>(U;, — M), which tends to a Gaussian, whatever the value of a. 


APPENDIX III 


This appendix concerns the transformation V=log U, which is used in most classical 
theories of income distribution. 

The strong objections against this transformation do not apply at all in the case of a law 
due to Estoup and Zipf, which is formally similar to that of Pareto, but relates to word 
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frequencies, and which we have studied since 1951 (cf. [14], [16]}). In the formal expression 
of that law U is replaced by the “‘rank’’ of a word, where words are crdered by decreasing 
frequencies. Hence log U has an intrinsic meaning as ‘‘cost of coding’; in particular, 
the addition of ‘‘costs’’ is quite meaningful. 

In the case of income, the transformation V=log U is justified in our [18] and [19], 
and in a more detailed article which ought to appear soon. 


[1] 


[2] 


[24) 
(25) 
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ON EQUILIBRIUM GROWTH OF CAPITAL AND LABOR* 
By YOIcHI SHINKAr 


IT Is one of the characteristic conclusions of the Harrod-Domar model 
that it is impossible to sustain full employment unless the warranted 
rate of growth coincides with the natural rate, and the initial stock 
of capital just equals the initial supply of labor multiplied by ‘‘the’’ 
capital-labor ratio. This ‘‘knife-edge notion of unstable balance’’ fol- 
lows from the crucial assumption that production takes place under 
conditions of fixed proportions of capital and labor. Thus Frofessor 
Solow [5] has shown that when production takes place under the neo- 
classical conditions of variable proportions and constant returns to scale, 
equilibrium growth is very likely to be stable. 

His conclusion is based on a one-sector analysis. In this paper we 
present a two-sector model, where all industries are divided into two 
branches, capital-good and consumption-good industries. On the as- 
sumption that only one technique of production is available to each of 
the two sectors, we find that if the capital-good industry is more 
capital-intensive than the consumption-good industry, the balanced 
growth of capital and labor is necessarily unstable; if the former is 
more labor-intensive than the latter, the balanced growth will be stable. 
Thus balanced (or equilibrium) growth in our model is of the Harrod- 
Domar type or of the Solow type, depending on the relative magni- 
tudes of the capital-labor ratios of the two sectors. 

A further historical note may be added. As the reader will find, our 
model has much in common with Mrs. Robinson’s ‘‘simple model’’ of 
capital accumulation [4, pp. 63-100] and Professor Morishima’s inter- 
pretation of her model [3], and our equilibrium growth corresponds to 
her ‘‘golden age’’ (under the hypothesis of no technical progress). If 
our stability condition is satisfied, full utilization of capital and full 
employment of labor can proceed harmoniously. If the ‘‘golden-age’’ 
path is unstable, on the other hand, the maintenance of full employ- 
ment will, in most cases, lead to a breakdown of capital accumulation. 


1. ASSUMPTIONS AND MODEL 


We start with the following assumptions: 


* Manuscript received October 4, 1959. 
1 Thanks are due to Professors L. R. Klein, M. Morishima, and the referee of this paper 
for many helpful comments. 
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(1) There are only two kinds of goods, capital goods and consump- 
tion goods. Capital goods are not demanded by consumers, and con- 
sumption goods are not used for production. 

(2) Each good requires for its production capital goods and labor 
as inputs; we abstract from the existence of land and scarce natural 
resources. 

(3) Only one technique is available to each industry, i.e., all tech- 
nological coefficients are constant. 

(4) The period of production in each sector is ignored. 

(5) Capital stock is freely transferable from one sector to the other. 

(6) The problem of depreciation is ignored, so that the output of 
the capital-good industry is defined net of the wear and tear of the 
capital stock of the whole economy. (This means that the output of 
the capital-good industry iz equal to the rate of net investment of the 
economy.) 

(7) Labor is homogeneous; the wage rate is always the same for all 
workers. 

(8) The supply of labor grows at a given constant rate. 

(9) Workers only consume; capitalists only save; all savings are re- 
invested by capitalists. 

The notation used is as follows: 

Sector 1 ... capital-good industry (or investment sector), 


Sector 2 ... consumption-good industry (or consumption sector), 
, mers output of sector i, 

| rrr labor employed in sector i, 

G. «satetees capital-labor ratio of sector i, 

Da vide aaetak labor-input coefficient of sector i, 

W waenke tun real wage rate, 

ey total supply of labor, 

E, .«ateensue stock of capital in existence, 

SB wcwemneee growth rate of N. 


The conditions for equilibrium (no excess capacity, no excess demand 
for consumption goods and full employment of labor) may be written 
as follows: 


(1) K=4a,N,+4,N,, 
(2) X,= W(N, + N,), 
(3) N=N,+N,. 


Given the stock of capital in existence K and the available supply 
of labor N, the equations (1), (2) and (3) will determine three varia- 
bles N,, N, and W, because we can reduce X, to N, by means of 
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Assumption 3. Once N, and N, are determined, the outputs of both 
sectors, X, and X, are also determined. This is what happens in a single 
period. 

In the next period the stock of capital in existence is incrzased by 
X,, while the labor force is larger by nN than before. With these 
new values of K and N inseried, (1), (2) and (8) will give new values 
of N,, N, and W, of which the first, in turn, will determine the rate 
of investment, X,, in the second period. Thus the accumulation of 
capital continues, always satisfying the conditions for equilibrium and 
full employment. 

This dynamic aspect of our model may be analysed by taking 


into account explicitly. Remembering that the growth rate of N is n, 
we easily obtain from (1), (3) and (4): 


a,N, + a,N, = mN,, 


(5) N,+N,=N,+N,, 


where m stands for the output per man in the investment sector (i. e., 


the inverse of },). In order for these equations to have solutions, the 


a, a, 


determinant If 1 must not vanish, that is: 





a=a,—a, + 0.’ 


This same condition must also hold for the existence of solutions to 
the static system (1) — (3) above. 
Now let A, and A, be the roots of the characteristic equation: 
m—@rAX — ar 


=0. 
(6) m—xX n—x 


Then the general solution of the equations (5) may be written in the 
form: 
(7) N, = Ae*s' + Bes , 

N, = Ce + De, 
where A, B, C and D depend on the initial conditions and the coef- 
ficients of the equations. By writing the initial stock of capital as K, 
and the initial supply of labor as N,, we have: 


% It 1s apparent that if a= 0, i. e., if the capital-labor ratios of both sectors are equal, 
our model loses most of the characteristics of a two-sector model. 
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A=cN,, 

B= (K,/c — N,/bn)d , 
(8) C=(1—e)N,, 

D=(N,/bn — K,/c)d = —B, 
where: 


c = a,bn/(1 — abn) , 
d=c/a. 
Solving (6), we find: 
4 = and A, =1/ad,. 


2. STABILITY OF EQUILIBRIUM GROWTH 


In normal circumstances the capital coefficient, a,b,, of the invest- 
ment sector will be at most 10, and the growth rate of labor, n, can 
hardly be greater than 0.02, so that we may safely assume:* 

1—abnr>od0. 
Hence 0<c¢<1 in (8), which means that both A and C in (7) are 
strictly positive. 

First, we shall be concerned with the case where the initial stock 


of capital, K,, is equal to cN,/bn. It follows from (8) that B and D 
vanish in (7). Thus we obtain: 


(9) N, = Ae“ and N,=Ce™. 


By virtue of the assumption of rigid production coefficients, X,, X,, 
K, and K, grow at the common rate n. In this case the economy is 
in a state of equilibrium (or balanced) growth. 

Second, let K, be smaller than cN,/b.n, a situation which may be 
referred to as the case of capital shortage.‘ If, in this case, the con- 
sumption sector is more capital-intensive than the investment sector, 
i.e., if a (= a, —a,) is negative, then d is negative, so that B>0O 
and D<0 in (7). Clearly N, is larger than in equilibrium growth, 
and N,is smaller. However, since \, is negative in this case, N, and 
N, eventually approach the equilibrium growth path. In the earlier 
stages capital is accumulating faster than the available labor increases, 
and capital stock will catch up with labor in the end.° 


3 It should be noted that although each of the values, a,b; and n, taken separately 
depends on the length of period we choose, the product of the two is independent of it. 

* We assume that Ky — cNo/bin is not so small as to make N, or Nz in (7) negative. 

6 It is interesting to note that some economists seem to regard Mrs. Robinson’s model 
as stable. Thus Professor Champernowne [1] and Dr. Little [2] present models of which 
the capital-labor ratio of the investment sector is zero. 





EQUILIBRIUM GROWTH OF CAPITAL AND LABOR 111 


If the investment sector is more capital-intensive than the consump- 
tion sector, on the other hand, i.e., if a is positive, we find that B < 0 
and D> 0, and that N, is smaller and N, is larger than the equilib- 
rium values (9). The output of capital goods, therefore, is smaller 
than in equilibrium growth; this will aggravate the shortage of capital 
in the next period. In fact, if a >0, thenr, > 0, so that Be gets 
smaller (algebraically) and De*** gets larger, when ¢ increases. Fur- 
thermore, since a,b,n is assumed to be less than unity, it can easily 
be shown that if a>0, A, >2r,. Hence the term Be** will eventually 
become dominant in the solution (7), so that N, will ultimately be- 
come zero. When N, is zero, the output of capital goods ceases, and 
capital accumulation will be at a standstill. 

Finally, much the same argument can be applied to the case of capital 
surplus (K, >cN,/b,n). In this case, too, equilibrium growth is stable 
if a is negative and unstable if it is positive. The main difference 
from the case of capital shortage lies in the way in which capital 
accumulation comes to a standstill when equilibrium growth is unstable. 
If K, >cN,/b.n anda>0, then B>0, D<0, and’, >A,>0in(7). As 
time goes by, N, becomes ever larger and N, smaller than the equilib- 
rium values (9). This state of affairs must come to an end when 
N, is zero, because in that event the output of consumers’ goods will 
be nil. 

Thus if, equilibrium growth is unstable, the slightest disturbance 
of initial conditions away from the path of equilibrium growth will 
sooner or later result in a state where either N, or N, is zero. After 
this point of time, it is impossible to maintain full utilization of 
capital and full employment of labor simultaneously. Unemployment 
of labor or excess capacity is inevitable according as the initial stock 
of capital K, is smaller or greater than cN,/b,n. 


Osaka University, Japan 
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A THEORY OF INTERSTATE DIFFERENCES IN THE RATES 
OF GROWTH OF MANUFACTURING EMPLOYMENT 
IN A FREE MARKET AREA* 


By JEROME L. STEIN’ 


IN THIS ARTICLE we shall develop a theory to explain the existence 
of interstate differences in rates of growth of manufacturing employ- 
ment among states linked together in a free market area. This free 
market area is characterized by free trade, free capital movements, 
and the use of a common currency. The contemporary theories of 
economic growth have concerned themselves with a closed economy and 
have not explained the existence of interstate differences in growth 
rates in a free market area. 

The United States of America constitute a free market area and, 
as such, provide an excellent subject for analysis. Over the period 
covered, certain empirical regularities have been discovered concerning 
the process of growth. These regularities are described in Part 1 
and are the phenomena to be explained by a theory of growth. The 
need for a theory of growth is developed in Part 2, where the inade- 
quacy of simplistic rationalizations is demonstrated. Part 3 presents 
a theory of growth which implies the phenomena cited in Part 1. The 
explanatory power of this theory is explicitly stressed in Part 4. 

There exist an unlimited number of theories which are compatible 
with a given set of data. The virtues of the theory presented here 
are: (a) it is testable against the available data; (b) it implies the 
data cited in Part 1; (c) the theory is conceptually simple and is 
based upon a competitive model; and (d) the crucial variable is the 
rate of growth of the labor supply curve rather than a nebulous vari- 
able like natural resources endowment. 


1. THE EMPIRICAL PHENOMENA TO BE EXPLAINED 


A. The Stability of Relative Growth Rates Within the United States. 
The first phenomenon to be explained is that interstate differences 
in rates of growth of production worker employment display certain 

* Manuscript received August 10, 1959. 

1 This work was supported by a Ford Foundation grant for the investigation of regional 
economic maturity. The author is indebted to George Borts, Michael Brennan, Phillip 
Cagan, Richard Muth, and Merton Stoltz for helpful advice and criticism, but is solely 
responsible for the views advanced here. 
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patterns of regularity. Certain states have consistently grown more 
rapidly than the national average; other states have consistently grown 
at rates below the national average. 

We have data on the average number of wage earners in manu- 
facturing establishments for forty-five states over the eight decades 
from 1869 to 1949. Define as a ‘‘success’’ the event that a yiven state 
exhibits a greater rate of growth of wage earners’ employment than 
the national average over a given decade. From 1869 to 1949 a state 
can have no fewer than zero and no more than eight successes. The 
distribution of states by number of successes is given in columns 1 
and 2 of Table 1 below. There are three states for which there are 
no data on the early periods. This leaves forty-five states with eight 
observations each. 

We must show that the distribution of states by number of suc- 
cesses is not a phenomenon produced by chance. There were a total 
of 184 successes out of 360 (= 45 x 8) observations. Assume that the 
event of a success or a failure is a Bernoulli trial with a fixed proba- 
bility of one-half. The distribution of states by successes would then 
be the same as the distribution of forty-five families, with eight 
children each, by the number of boys per family. The probability of 


k successes in eight trials would be d(k; 8, 4) or (ja. The expected 


number of states would be equal to 45 times b(k; 8,4). This is given 
in column 8 of Table 1. The data were grouped to insure at least 
five observations per cell. 














TABLE 1 
THE DISTRIBUTION OF STATES BY NUMBER OF SUCCESSES, 1869-1949 
Number of Successes Number of States | Expected Number of States 
0 3) 
: - 10 6.5 
2 4 
3 6 10 
4 8 12 
5 8 10 
6 9\ 
i ‘| 13 6.5 | 
8 1 
45 45 








Source: R. A. Easterlin, ‘Estimates of Manufacturing Activity,”’ in E. S. Lee, et. al., 
Population Redistribution and Economic Growth, United States, 1870-1950 (Philadelphia: 
The American Philosophical Society, 1957). 








114 JEROME L. STEIN 


A chi-square test produces a value of chi-square of 11.7 with four 
degrees of freedom. This is statistically significant at the two per 
cent level. We may therefore reject the hypothesis that the data in 
columns 1 and 2 were produced by Bernoulli trials with a probability 
of one-half. 

Certain states consistently have grown more rapidly than the national 
average; other states consistently have grown less rapidly than the 
national average. A satisfactory theory of growth must explain this 
phenomenon. 


B. Variables Which are Significantly Associated with the Rate of 
Growth of Manufacturing Production Worker Employment. 

Let us focus upon a period between two business cycle peaks, 1919 
to 1953. During this period, the rate of growth of manufacturing 
employment in a state (i) was negatively related to the ratio of manu- 
facturing to total state employment, (ii) was positively related to the 
rate of in-migration into a state, and (iii) was not significantly related 
to the level of the wage prevailing in the state’s manufacturing 
industries. 

1. The aggregate approach. 

The rate of growth of manufacturing production worker employment 
between 1919 and 1953 was computed from the Census of Manufactures 
and the Annual Survey of Manufactures. Call this variable Y. The 
ratio of manufacturing to total state employment, (ratio X,), was com- 
puted for 1940, a census year between the initial and terminal dates. 

The rate of (in)migration by state was calculated over the period 
1920-50. We summed the total migration into the state of people in 
the age group 15-64 during the period 1920-50. This figure was 
divided by the total number of people in the age group 15-64 who 
lived in the state during the period 1920-50.’ Call this resulting variable 
X,. Given the values of X, and X, by state, how significant are they 
in explaining growth rates of employment by states from 1919 to 1953? 

X, and X, are expressed in standard measure and become z}j and 2} 
respectively.’ Y becomes y when expressed in standard measure and 

2 Everett S. Lee, ‘Migration Estimates,’’ in Everett S. Lee, Ann R. Miller, Carol P. 
Brainerd, and Richard A. Easterlin, Population Redistribution and Economic Growth, 
United States, 1870-1950 (Philadelphia: The American Philosophical Society, 1957), Table 
P-1. 

8 X, is the ratio of manufacturing to total state employment multiplied by 1,000. Hence, 
0.2135 became 213.5 which was rounded to 214. In standard measure x; = (Xi — X;)/Si. 
X; is the ratio of in-migration to state population of people of labor force age plus 0.184 
to eliminate negative numbers. The resulting figure was multiplied by 1,000. Hence, 
—0.106 becomes 78. 2, = (Xz — X3)/Ss. 
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y is the value of y predicted from the regression equation. When 
the variables are expressed in standard measure, the regression coef- 
ficients of x{ and x; have the same standard error. A double star 
indicates that the regression coefficient is significantly different from 
zero at the one per cent level. 


(A) y = —0.5942x; + 0.39122; , R = 0.626** . 


The standard error of the regression coefficients is 0.12. We conclude 

that: 

(i) we must reject the null hypothesis that the ratio of manufactur- 
ing to total state employment has no effect upon the growth rate. 
The regression coefficient of x{ is —0.5942 and is significantly dif- 
ferent from zero at the one per cent level; and 

(ii) we must also reject the null hypothesis that the rate of migration 
has no effect upon the growth rate. The regression coefficient of 
x; is 0.3912, which is statistically significant at the one per cent level. 
If we try to predict growth rates by substituting the 1939 level of 

the production workers’ annual earnings‘ for the migration rates, we 

do not meet with success. It is true that migration occurs from low- 
to high-wage states, but the level of the wage has an ambiguous net 

effect upon the growth rate. Call the average annual earnings of a 

production worker in manufacturing X,. Let X, and Y have the same 

meanings as above. 


(B) y = —0.5099x; + 0.1025¢3, R=0.509**. 
** 


The standard error of the regression coefficient is 0.1290. We cannot 
reject the null hypothesis that the level of the wage has no effect 
upon the rate of growth. The regression coefficient 0.1025 is not 
significantly different from zero at the five per cent level. 

2. An industry by state analysis. 

Interstate differences in industrial composition were ignored in the 
immediately preceding section. Insofar as interstate differences in 
industrial composition exist, it is possible that employment in state A 
grows more rapidly than it does in state B; nevertheless, for each 
industry which is located in both states, employment grows more rapidly 
in state B. That is, state A may have a concentration of rapidly 
growing national industries, whereas state B may have a concentration 
of slowly growing national industries. 

When interstate differences in industrial composition are taken into 
account, are the variables X, and X, still significantly related to the 


‘ The year 1939 was a census year and is close to the year in which X; was measured. 
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state’s rate of growth? A hypothesis, subject to test, can be formu- 
lated as follows: 

The probability that manufacturing industry X will grow more 
rapidly (in terms of production worker employment) in state A than 
in the United States as a whole depends: (a) negatively upon the 
ratio of manufacturing to total employment in the state; and (b) 
positively upon the rate of migration into the state. 

Fourteen manufacturing industries and forty-eight states were ex- 
amined between 1947 and 1953. 

The industries were: 


SIC Number Industry 

22 Textile Mill Products 

23 Apparel and Related Products 

25 Furniture and Fixtures 

26 Paper and Allied Products 

27 Printing and Publishing Industries 

28 Chemicals and Allied Products 

30 Rubber Products 

31 Leather and Leather Products 
Primary Metal Industries 

34 Fabricated Metal Products 
Machinery (except Electrical) 

36 Electrical Machinery 

37 Transportation Equipment 

38 Instruments and Related Products. 


Two industries were excluded because they are market oriented, 
and two industries were excluded because they are natural resource 
oriented. In the first category we have SIC Number 20, Food and 
Kindred Products, and SIC Number 28, Chemicals. The heavy in- 
dustrial chemicals such as sulfuric acid cannot be transported very 
far. There are also many chemical products, such as the by-products 
of refineries, which are natural resource oriented. In the. second 
category we have SIC Number 21, Tobacco Products, and SIC Number 
24 Lumber and Wood Products (except furniture), which must be 
located near forests. 

The rate of growth of production worker employment in each in- 
dustry, in a given state, was compared with the rate of growth of 
production worker employment in the same industry in the United 
States as a whole. If the industry grew more rapidly (in terms of 
production worker employment) in a given state than it did in the 
United States, we score a success. Otherwise, we score a failure. 
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The forty-eight states were ranked in terms of the ratio of manu- 
facturing to state employment in 1950. Row 1 contains the top third 
of manufacturing states, row 2 contains the middle third of manu- 
facturing states, and row 3 contains the bottom third. In column 1 
we place the states’ which had net in-migration of civilian population 
from 1945 to 1954; in column 2 we place the states which had net 
out-migration of civilian population during this period. The following 
3 x 2 table was then obtained. It is important to note that labor 
migrates from non-manufacturing to manufacturing states. Hence, 
extreme caution must be used in interpreting this table. 

On the whole 281 industries and states were covered, of which 130 
grew more rapidly than their national counterparts. Thus, the proba- 
bility of success in the entire sample is 0.4626. The hypothesis subject 
to test claims that the probability of success is negatively related to 
the ratio of manufacturing to total employment. As we descend from 
row 1 to row 2 to row 8, the probability of success is expected to 
increase. 

This, in fact, occurs, whether we look at the states with net in- 
migration or with net out-migration. Among the states which had 
net in-migration, the probability of success increased from 0.2824 to 
0.6071 to 0.8571 as the ratio of manufacturing to total state employ- 
ment decreased. Similarly, among the states which experienced net 
out-migration, the probability of success increased from 0.50 to 0.5769 
to 0.7857 as the ratio of manufacturing to total state employment 
decreased. 

A test of statistical significance was applied to the rows. The 
probability of success in the entire sample is 0.4626. We may ask 
whether the differences in probabilities of success among the rows 
arise solely from random sampling out of a common population with 
a probability of success of 0.4626. 

Row 1 contains 147 observations; row 2 contains 106 observations; 
and row 3 contains 28 observations. If the rows merely represent 
random samples from a population with a probability of success of 
0.4626, then we should expect 68, 49, and 13 successes in rows 1, 2, 
and 3 respectively. Chi-square is 18.92** with two degrees of freedom. 
This is significantly different from zero at the one per cent level. We 
may reject the hypothesis that rows 1, 2, and 3 are random samples 
drawn from a common population. 

6 Several states were not included in the table below due to the paucity of data, result- 


ing from the disclosure rule. 
6 In regression (A) above rj. was positive though not significant at the five per cent level. 
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TABLE 2 
PROBABILITIES OF GROWING MORE RAPIDLY THAN THE NATIONAL AVERAGE, 1947-53* 
Net In-migration n s Net Out-migration ns 
Top third New Hampshire 3 1 Maine 4 1 
in terms Massachusetts 13 4 Rhode Island 5 2 
of ratio Connecticut 11 2 North Carolina 4 $ 
of manu- New York 14 4 South Carolina a 
facturing New Jersey 13 5 i660 8 — 
to total em-| Pennsylvania 14 4 Piz = 0.5000 
ployment Ohio 13 4 = 45 
Maryland 9 4 va 
Illinois 12 0 p = 0.3061 
Michigan 12 5 
Wisconsin 12 2 
Delaware 5 m3 
Isl 37 
Pu = 0.2824 
Middle Minnesota 4 1 Vermont 2 1 
third Maryland 1i 5 Iowa 6 4 
Oregon 4 2 Missouri 10 5 
California 9 9 Virginia 9 5 
os M7 West Virginia 5 1 = 106 
Pn = 0.6071 Georgia 7 4 — 
Kentucky 7 4 8 = 62 
Tennessee 9 4 
Alabama 6 4 p = 0.5849 
Arkansas 6 5 
Louisiana 6 4 
Washington eo 
78 45 
Pn = 0.5769 
Bottom | Florida 2 2 | Nebraska 2 0 
third | Texas 10 9 Kansas 4 4 28 
Colorado ; 8 Mississippi S..-f% ~ oem 
| Arizona 1 1 Oklahoma 3 2 s= 23 
Té T2 | Utah ire 
ps: = 0.8571 i ti | »= 9-84 
Psa = 0.7857 
| »=173 n = 108 n = 281 
| s = 66 a= 64 3 = 130 
| p = 0.3815 p = 0.5926 p = 0.4626 











Source: For growth data, Census of Manufactures, 1947 and Annual Survey of Manu- 
factures, 1953. For data on the ratio of manufacturing to total employment, Statistical 
Abstract, 1954. For migration, U. S. Bureau of the Census, Current Population Reports, 
Population Estimates, Series P-25, No. 124, Oct. 24, 1955, p. 7. 


* n - number of cases, s = number of successes. 


The hypothesis also claims that the probability of success is positively 
related to the rate of migration. States having net in-migration are 
expected to have a higher probability of success than states having 
net out-migration. The data presented in Table 1 are not conclusive, 
because the rate of migration and the ratio of manufacturing to 
total employment are positively correlated. Except for row 1, the 
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probability of success is greater in column 1 than in column 2. Given 
the ratio of manufacturing employment, states receiving net in-migra- 
tion did better than states having net out-migration in two out of 
three cases. The states in row 1, column 1 are heavily industrialized 
compared to North Carolina, South Carolina, and Maine in row 1, 
column 2. Hence, the poorer performance of states in row 1, column 
1 compared to the states in row 1, column 2 may merely reflect the 
dominant effect of industrialization (i.e., the ratio of employment in 
manufacturing) in inhibiting growth. The multiple regression analysis 
used in Section 1 above is a finer test of the hypothesis than the 
probability test used here, because of the correlation between migration 
and industrialization. 


2. A REJECTION OF SIMPLISTIC RATIONALIZATIONS 


It would seem that the above empirical findings could be rationalized 
with a little commonsense reasoning and a little economic theory. 

(A) Simplistic view: Of course, states with low proportions in 
manufacturing grow fast in manufacturing employment. Those are 
the states where there is room for expansion: the percentage growth 
from a small base looks big. 

Manufacturing employment grew from M, to M,, yielding a growth 
rate of AM/M,. If AM were randomly distributed among states, then 
it is clear that AM/M, would be greater among states with low M, 
than among states with high M,. This is the only rationale I know 
for the argument that high growth rates are to be expected from states 
with low bases. It was observed that AM/M, was negatively associated 
with M,/L, (the ratio in manufacturing at an intermediate period). 
Hence, the simplistic view maintains that M,/L, is positively correlated 
with M,. (L, is total state employment.) 

I tested the simplistic hypothesis directly: namely, that the differences 
in growth rates are attributable to variations in M,. A statistical 
test was made of the null hypothesis: the initial values of M, are not 
different between states with high AM/M, and states with low AM/M,. 
This test attempts to determine whether the high growth states started 
from lower bases than did the low growth states. 

The states were divided into two groups, the states which had above 
median rates of growth of manufacturing production worker employ- 
ment from 1919 to 1953, and those which did not. The initial amounts 
of manufacturing production worker employment were listed by state, 
and a test for the difference between two means was performed. To 
be sure, the mean manufacturing production worker employment in 
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1919 was lower among the high growth states than among the low 
growth states, but this difference is not significant at the five per 
cent level. Hence, we may not reject the null hypothesis that the 
1919 levels of employment in the two sets.of states were drawn from 
the same population. This effectively rejects the argument that the 
differential growth rates were produced merely by the different bases. 

(B) Simolistic view: One would expect growth and in-migration 
to be positively associated. After all, in-migration is evidence, in it- 
self, of good manufacturing job opportunities. People go to the jobs; 
hence, employment growth looks biggest where people are going in 
largest numbers. 

The surprising thing is that the rate of in-migration (X,) and the rate 
of growth (Y) are not correlated at the zero order. The total correla- 
tion between X, and Y is 0.2464, which is not significantly different 
from zero at the five per cent level. It is by no means obvious that 
the rate of migration is positively associated with the rate of growth. 

Only in a partial correlation does the influence of migration upon 
growth manifest itself. The regression coefficient of x} (in regression 
(A), above) is statistically significant at the one per cent level. When 
the influence of X, (the ratio of manufacturing employment) is taken 
account of, then the rate of migration and the rate of growth are 
positively and significantly associated. 

One must explain why the total correlation r(Y, X,) is not significant 
whereas the partial correlation r(Y, X,-X,) is significant. The ex- 
planation must be based upon economics and not solely upon the 
truism that (X,, X,) are correlated. 

Moreover, one must explain why the wage level (X;) is not significant- 
ly associated with the rate of growth (Y). A simplistic theory fails 
to do these things. 


8. A THEORY OF GROWTH 


The inability of the simplistic rationalizations to explain the above 
phenomena necessitates the construction of a theory of growth which 
can explain the empirical regularities described in Part 1 above. 

Every theory is a simplified description of reality. Assumptions are 
made which may not be exact descriptions of reality, in order to ex- 
pose the basic forces which operate to produce the phenomena to be 
explained. The introduction of ‘‘more realistic’? assumptions may 
merely complicate the analysis and prevent us from understanding the 
nature of the forces at work.’ The art of theorizing consists of find- 


7 See Milton Friedman, Essays in Positive Economics (Chicago, 1953), ch. L. 
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ing the crucial variables that produce the phenomena to be explained 
and not getting involved with a myriad of variables which are of 
secondary importance. The crucial tests of a theory are (i) whether 
its predictions are compatible with the data to be explained and (ii) 
whether it helps us to understand the workings of our economic 
system. The theory developed here should be evaluated in terms of 
its ability to explain’ the material in Part 1 above, and not in terms 
of the verisimilitude of the assumptions made.’ 


A. Assumptions. 

1. Assume that the f.o.b. price of a product produced by a given 
manufacturing industry is the same for each firm, regardless of its 
regional location. Transport cost varies directly with the distance of 
the consumer from the factory concerned. The transport cost per mile 
may differ among firms. Consumers are assumed to buy from the 
firm for which the total price is cheapest. As Schneider” has shown, 
well defined market areas exist for the various firms, under these as- 
sumptions. 

Moreover, let us assume that the producers in any single state do 
not significantly affect the price of a given product by varying the 
state’s rate of output. This assumption is convenient, but not crucial, 
for analysis. Let the f.o.b. price of output, p, be determined by 
national supply and demand conditions. The i.o.b. price p grows at 
p* per cent per year. 

2. The price of capital goods to each firm in a given manufacturing 
industry is equal to Z = Z, + ts, where Z, is the uniform f.o.b. price 
and s is the distance from the producer. The variable ¢t is transport 
cost per mile. The variable ts is assumed to have the same density 
function in each state for firms in a given manufacturing industry. 
Hence, the average c.i.f. price of capital goods, to firms in a given 
industry, will be the same for producers in each state. 

Price Z, is determined by national supply and demand factors; the 
c.i.f. price grows by Z* per cent for the average firm in each state, 
in a given industry. 

8. Production functions of firms in a given manufacturing industry 
are assumed to be identical among states and subject to constant 
returns to scale. The uniformity of production functions is based on 

8 There is a formal equivalence between an explanation and a prediction. See Carl G. 
Hempel and Paul Oppenheim, ‘‘Studies in the Logic of Explanation,’’ Philosophy of Science, 
Vol. 15, No. 2 (April, 1948). 

® See Friedman, op. cit. 

W E. Schneider, ‘‘Bemerkungen zu einer Theorie der Raumwirtschaft,’”’ Econometrica, 
1935; R. G. D. Allen, Mathematical Analysis for Economists (London, 1949), pp. 80-82 
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the view that interstate differences in soil fertility and climate are 
not very important in affecting manufacturing output, given the in- 
puts of labor and capital. 

Since we assumed that there are constant returns to scale for each 
firm, we must prevent a single firm from monopolizing the industry. 
Let us impose a limit to the size of each firm such that competition 
is allowed to prevail. Entry into this industry is free. 

4. Finally, assume that pure competition prevails and firms attempt 
to maximize profits. 

The assumptions enable us to expose the fundamental role of labor 
supply conditions in generating interstate differences in the rate of 
growth of manufacturing employment. The usefulness of the model 
and assumptions are examined in terms of the explanatory value of 
the model. 


B. Analysis. 

1. Every firm in a given manufacturing industry will select that 
ratio of capital to labor Q, such that the value of the marginal product 
of capital goods is equal to the price of capital goods. If f is the 
marginal-physical-product-of-capital function, p is the f.o.b. price of 
output, and Z is the c.i.f. price of capital goods, then 


(1) {Q=Zip, f'<o0. 


Ic was assumed that each firm in the industry has the same f, 
which is a function solely of Q. In addition, p was assumed to be the 
same for all firms regardless of their regional location. Although the 
c.i.f. price of Z varies according to the purchaser’s distance from the 
factory, the average value of Z will be the same from state to state 
in the given manufacturing industry. Consequently, in long-run 
equilibrium, Q will be the same for firms in the given industry re- 
gardless of their regional location. 

Given the long-run equilibrium value of Q and the price of the product 
p, the marginal-physical-product-cf-labor function F determines the 
equilibrium wage, equal to the value of the marginal product of labor: 


(2) pF(Q)=w, F’>0. 


Insofar as p, F, and Q are, on the average, the same in each state 
in the given industry, the long-run equilibrium wage will be uniform 
among states. 

Since the long-run equilibrium wage is determined, the labor supply 
function determines the quantity of labor that will be forthcoming 
and employed in each state. 
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(3) w=+gL), 
Vv 


where v is the shift factor and L is the quantity of labor supplied. 
When v rises, the labor supply schedule shifts to the right, and when 
v falls, the labor supply schedule shifts to the left. 

We solve for the state rate of growth of employment in the given 
industry, L* = L/L, in terms of three parameters. The parameters 
are the rate of growth of the product price, p* = p/p; the rate of 
growth of the average c.i.f. price of capital goods, Z* = Z/Z; and the 
shift of the state labor supply function, v* = #/v. Equations (1.1), 
(2.1) and (3.1) are derived by differentiating the logarithms of equations 
(1), (2) and (3) with respect to time: 








1.1 Z*— n* = F'(QQ A 
— oa 

s -. Soe 
2.1 — p* = 
(2.1) w p FQ) 
3.1 $9 = LOL 
= Pits g(L) 


Define the following four terms. 

(a) u=d log Q/d log f(Q) = {f(Q)/F'(@)}(1/Q). This is the elasticity 
of the marginal-product-of-capital schedule with respect to Q and is 
negative. 

(b) e = d log Q/d log F(Q) = {F(Q)/F’'(Q)}(1/Q). This is the elastic- 
ity of the marginal product of labor schedule with respect to Q and 
is positive. 

(c) s = {g(L)/g’(L)}(1/L). This is the elasticity of labor supply. 

(d) k* = Z* — p*. This is the growth of the ‘‘real’’ price of capital 
goods. 

A substitution of (a)-(d) into (1.1)-(3.1) yields 


(4) L* = s( p* re Kr) + svi, 


where 7 refers to the ith state. 

The rates of growth of the product price p* and the real price of 
capital k* are assumed to be the same in all states in a given manu- 
facturing industry. Insofar as the production function is identical 
among firms in this industry, u/e is the same among states. Suppose 
that the elasticity of labor supply were uniform among states. Then, 


(5) Lt 2 {p* + (uje)k*} + vf : 
Le = {p* + (u/e)k*} + oF 
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Interstate differences in rates of growth of employment in a given 
manufacturing industry, from one long-run equilibrium to another, 
arise solely from interstate differences in the growth of the labor 
supply function. 

2. It is well known that interstate differences exist in the average 
wage paid in given manufacturing industries. According to our model, 
this means that firms are not at their long-run equilibrium positions." 
Firms in low-wage states will yield higher marginal products of capital 
than firms (in the same industries) located in high-wage states. The 
length of time required to bring the wage up to the equilibrium value 
depends upon the speed at which capitai flows into low-wage areas to 
benefit from the higher marginal product of capital. 

Suppose that the equilibrium wage rose by y = p* + (u/e)k* per 
cent from (say) 1919 to 1953. If the wage in the low-wage state rose 
to the new equilibrium, then the per cent increase in the wage in the 
lew-wage state would be d(l+y)+y per cent.” The variable d 
reflects the per cent difference between the equilibrium and the initial 
wage. In actuality, the wage may only rise by x per cent of vhat 
amount, due to the insufficiency of the capital inflow that occurred 
during the period. Consequently, the relative rate of growth of em- 
ployment in the low-wage state to the high-wage state, in a given 
manufacturing industry, is” 


(6) LP/LY = [y + zd + y) + of /[y + v7] . 


z is between zero and unity and reflects the extent to which the 

disequilibrium has been decreased. 
It thus appears that the rate of growth of employment, in a given 

state in a given manufacturing industry, depends upon: 

(i) the per cent increase in the equilibrium wage, y, which is the 
same for each firm in the given manufacturing industry, 

(ii) the rate at which the labor supply schedule is shifting to the 
right in a given state, v*, 

(iii) the extent to which the wage was initially below the equilibrium 


4 According to our model w= pF(Q). Since p, F, and Q are the same for firms in 
each state, long-run equilibrium implies wage equalization. Since F’ >0, given Q and 
p, a unique w results. 

12 Let wo be the initial equilibrium wage and wo’ be the new equilibrium wage, in both 
states. State 1 is the low-wage state where w, was initially below wo, but state 2 initially 
paid the equilibrium wage, i.e., wz=we. Suppose that capital flows into both states and 
wage rates rise to wp’. 

In state 1, wage will rise by (wo’ — w,)/w, and in state 2, wage will rise by (wo’ — wo)/wo. 
(wo' — wi)/w; = {(wo’ — wo) + (wo — wi)}/wi. Let (wo’ — wo)/wo=y and (we — w:)/w; = d. 
Then, (wo’ — w,)/w, = d(l + y+ y.- 

% Assume that 8; = 85. 
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wage, i.e., d, and 
(iv) the extent that the disequilibrium is eliminated, i.e., 2. 

Low-wage states, on the basis of (iii), would tend to have higher 
rates of growth of employment than high-wage states in the same 
industries, provided that migration did not occur from low- to high- 
wage states. 

We shall now complete our model by discussing the determinants of 
v*, the rate at which the labor supply curve shifts. There is evidence 
that out-migration occurs from low-wage to high-wage states. Asa 
result, the low-wage states, on balance, need not have higher rates 
of growth of employment than high-wage states. 


C. The Determinants of the Shift of the Labor Supply Curve. 

The basic conclusion of the model is that interstate variations in 
rates of growth of manufacturing employment in a given industry 
stem from differences in the shift (v*) of the labor supply schedule 
and from deviations (d) between the initial wage and the initial equi- 
librim wage. Interstate differences in the elasticity of labor supply (s) 
facing a given industry are ignored. Attention will now be focused 
solely upon interstate differences in v*. 

The shift, or growth, of the labor supply curve facing the mznufac- 
turing sector of a given state arises from several sources. 

1. Intra-state migration from non-manufacturing employment to 
manufacturing employment will cause the labor supply schedule facing 
the manufacturing sector to shift. 

It is well known that (1) the rate of natural increase of the labor 
force is greater in rural than in urban areas. Therefore, the rate of 
natural increase of the labor force is negatively associated with the 
degree of urbanization in a state. However, (2) the degree of urbani- 
zation is positively related to the per cent of the labor force employed 
in manufacturing. It follows that the rate of natural increase of the 
labor force is negatively related to the per cent of the labor force 
employed in manufacturing. The smaller the ratio of manufacturing 
to total state employment, the greater the potentialities for the growth 
of the manufacturing labor supply schedule through a migration of 
labor from non-manufacturing to manufacturing employment. In effect 
the ratio of non-manufacturing to total employment is used as a proxy 
for the labor force replacement rate,” i.e., the rate of natural increase 


4 The replacement ratio is defined as the ratio of the expected number of entrants into 
the aye group 25-69 during a decade per 100 expected number of departures from this 
age group during the same decade through death or attaining retirement age. The re- 
placement ratio was not used for the states because of the extensive calculation involved. 
In an extension of this study, these calculations will be made. 
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of the labor force. 

Whether or not the labor force will shift from the non-manufactur- 
ing sector into the manufacturing sector depends upon the relative 
attractiveness of the manufacturing and non-manufacturing sectors. 
For example, a state may have 10 per cent of its employment in 
manufacturing, but the average income in manufacturing may be lower 
than the average (labor) income received in any other sector. The 
incentive for the intra-state shift from the non-manufacturing sector 
to the manufacturing sector would then be negative. Labor would 
tend to flow out of manufacturing into the higher average earning 
non-manufacturing sectors. The factors which determine the magnitude 
of the shift from the non-manufacturing to the manufacturing sector 
of a given state are: (a) the relative size of the non-manufacturing 
sectors as a proxy for the rate of natural increase of the labor force 
and (b) the relative attractiveness of the manufacturing sector com- 
pared to the non-manufacturing sectors. 

The attractiveness of the manufacturing sector, for production 
worker employment, may be measured by the value of the average 
production worker’s annual earnings in manufacturing relative to state 
per capita income. The average production worker’s annual earnings 
represent the most likely amount that a production worker can earn 
in the manufacturing sector. This figure takes into account the aver- 
age hourly earnings among the different manufacturing industries, the 
state composition of manufacturing industries, the opportunities for 
overtime work, and the steadiness of employment. State per capita 
income reflects the income received from all sources. The higher the 
ratio of the average production worker annual earnings to state per 
capita income, the greater the incentive for the labor force to shift 
into the manufacturing sector. 

As a matter of historical evidence, the ratio of average yearly 
production worker earnings to state per capita income is negatively 
correlated with the ratio of manufacturing to total employment. In 
1940, the correlation between these two variables was —0.65, among 
the forty-eight states. This is significantly different from zerc at the 
one per cent level. Thus, the incentive to shift into the manufactur- 
ing sector appears to decrease as the ratio of manufacturing to total 
employment increases. 

2. A second major source for v* is interstate migration. The 
greater the rate of in-migration of people in the labor force age, as 
a percentage of the state labor force, the greater will be the right- 
ward shift of the labor supply curve facing the manufacturing sector. 
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Migration occurs, to a large extent, from low-wage to high-wage 
states.” Hence, v* is affected by the ratio of the wage in one state 
to the average wage prevailing in the other states. 

Ir summary, the shift of the labor supply curve facing the manu- 
facturing sector of a given state (v*) is affected by four variables: 

(a) the natural rate of increase, for which the ratio of non-manu- 
facturing to total state employment is a proxy; 

(b) the relative attractiveness of the manufacturing sector to the 
other sectors within the state; 

(c) the rate of in-migration, which is affected by the ratio of the 
wage in this state to that prevailing in other states; 

(d) non-economic factors, such as climate. 


4. THE EXPLANATORY POWER OF THE THEORY 


The theory implies the phenomena described in Part 1 above, and 
may therefore be considered an explanation of interstate differences 
in growth rates. 

1. The theory implies that the rate of growth of manufacturing 
production worker employment, in a given manufacturing industry, 
will be higher in states with low ratios of manufacturing to total em- 
ployment than in states with high ratios. This is compatible with the 
results of regression (A) and Table 2. 

2. The theory implies that, when the manufacturing ratio is taken 
into account, the rate of in-migration will be positively associated 
with the rate of growth. This actually occurs, as is shown by regres- 
sion (A) above. It is likely that the partial correlation of migration 
and growth reflects the fact that workers migrate from low-wage to 
high-wage areas. Consequently, the v* in low-wage areas is less than 
one would expect on the basis of the rate of natural increase in the 
labor force; v* in high-wage areas is greater than one would expect 
on the basis of the rate of natural increase. 

8. The rate of growth of manufacturing production worker employ- 
ment, in a given manufacturing industry, will not necessarily be higher 
in low-wage states than in high-wage states. The level of the state 
wage has two counterbalancing effects upon the rate of growth of 
employment. 

(a) The marginal rate of return on capital is higher in low-wage 
states, in a given industry, than in high-wage states. Capital is there- 

18 See the work of Richard Muth and Harvey Perloff, Variation in Levels and Rates 
of Growth of Per Capita Income, Part IV of ‘‘Regional Growth in the United States,” 
ch. 25 (Resources for the Future, 1959). 
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fore attracted to the state. As capital flows in, this generates a 
demand for labor. Thus, wages and growth rates should be correlated 
negatively. 

(b) On the other hand, out-migration of labor occurs in low-wage 
states as labor flows from low- to high-wage states. As a result, 
out-migration tends to reduce v* (the shift of the labor supply schedule 
facing manufacturing) in low-wage states. In high-wage states, in- 
migration of labor from low-wage states tends to increase v*. This 
implication is compatible with the finding of regression (B), where the 
regression coefficient of 2; was not significant. 

4. The stability of relative growth rates, in terms of Table 1, 
is partially explained in terms of the invariance of the rankings of 
states by the per cent of employment in manufacturing. The com- 
position of the ten states with the greatest ratios of manufacturing 
employment was practically unchanged from 1920 to 1940. Similarly, 
the composition of the ten states with the lowest ratios of mannfac- 
turing employment was practically unchanged during this period. The 
former group is dominated by states with no more than two successes 
(cf. Part 1), whereas there is no more than one state in the latter 
group which had fewer than four successes (cf. Part 1). 


Brown University, U.S.A. 
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BALANCED GROWTH OF FIRMS IN A COMPETITIVE 
SITUATION WITH EXTERNAL ECONOMIES* 


By MICHIO MORISHIMA AND GERALD L. THOMPSON? 


THE ASSUMPTION of independent production possibilities underlying 
the traditional general equilibrium theory implies that external econ- 
omies and diseconomies do not prevail. More generally, however, in 
the presence of such economies and diseconomies, the level of activity 
in the individual firm affects the production possibilities of other firms 
through its influence on the general level of activity in ways other 
than through possible effects on the prices of goods and factors; that 
is to say, the production possibilities of a firm depend on the general 
development of industries. It is, therefore, very important to examine 
various effects of external economies and diseconomies on the develop- 
ment of firms.’ In particular, it is interesting to compare the balanced 
rate of growth and the consumption level in a competitive situation 
in which such economies exist with the corresponding magnitudes 
in a situation in which these economies are absent. In the context of 
the model developed in this paper, we demonstrate (and this may be 
intuitively apparent) that the balanced rate of growth is higher in the 
former than in the latter case if the consumption level is the same, and 
that the highest allowable level of consumption is higher in the former 
than in the latter case if the rate of balanced growth is the same. 

In the first section we describe a model in which external economies 
or diseconomies of the ‘‘unpaid-factor’’ type (in the sense of Professor 
Meade [5]*) prevail. This is a mere generalization of the von Neumann 
model [2, 7] of balanced growth, which assumes the constancy of the 
technological coefficients, i.e., the absence of any external economies 

* Manuscript received September 15, 1959. 

1 This paper is based on Thompson’s ‘‘A Generalization of the von Neumann Dynamic 
Model”’ (mimeographed), the research on which was supported by the Office of Naval 
Research, under contract with the Econometric Research Project of Princeton University. 
We express our gratitude to Professors Kumagai and Nikaid6é of Osaka University, Mr. 
R. J. Ball of the Wharton School of Finance and Commerce, University of Pennsylvania, 
and the referee of this paper, who made valuable suggestions and criticisms. 

2 The question of the relation between external economies and Pareto optimality is not 
discussed here, although it is surely of some interest. Under the assumption of no 
external economies E. Malinvaud [4] studied the correspondence between maximal and 


efficient balanced growth programs. 
8 Numbers in brackets refer to the list of references at the end. 
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or diseconomies. In the second section we are concerned with the 
existence of balanced growth solutions and show that, even if external 
economies and diseconomies exist, all firms may grow or decay propor- 
tionately. The less mathematically inclined reader may omit this 
section —— though the existence result is, of course, crucial and 
fundamental in the discussion of mathematical economics. In the third 
section we show how external economies react upon the rate of balanced 
growth of firms, and in the last section we compare the consumption 
level in the presence and absence of external economies. 


1. THE ECONOMIC MODEL 


We consider an economy in which there appear m firms (subscript 
i or r) and m goods (subscript 7 or s). It is assumed that there are a 
large number of competing firms, so that for each individua! entrepreneur 
the market prices of the products and the factors are given. Let 

d,{t) be the quantity of good j produced by firm i in period ¢, 

e,{t) be the quantity of good j used up by firm 7 in period ¢, 

qt) be the number of workers‘ employed by firm i in period ¢. 
Define the following vectors: 


d,(t) = [d,,(t), vig d,,(t)] , 
eft) = [€,,(t), +++, n(t)] , 
d(t) - [d,,(t), ora d,,(t), aS» d,,,(t), steak dan(t)] ’ 
eft) = [,(t), +++, Crn(t), ***s Cult), ***, Cunle)] 
g(t) = [¢,(t), +++, Gn(t)] . 
For the individual firm, the traditional production function may be 
written implicitly, 


Fid{t), et), g{H] = 0. 


This assumes that inputs are transformed instantaneously into outputs, 
i.e., the period of production is zero. If we assume a unit production 
period, current outputs will depend on the inputs of the previous 
period, so we have 


gid (t +1), e(t), a()) =0. 


This is the case in the absence o1 external economies and diseconomies. 
In their presence, however, the current output of the ith firm will 
depend also on the activities of all firms. It is plausible to suppose 
that it will take time to diffuse the effects of the general development 


* This paper assumes that labor is indispensable. This rules out the possibility of 
automatic factories. 
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of industries throughout the economy. Consequently, we postulate 
that the level of current output of the ith firm depends not on the 
current activities of all firms but on their activities in the previous 
period. It follows that the production function of the individual firm 
can be rewritten, 


d{d(t +1), e(t), a(t); dt), et — 1), a(t -—1)] =0. 


Professor Meade did not include th’s time lag, but assumed that the 
current activity of a firm would instantaneously affect the production 
possibilities of other firms. 

On the other hand. we follow Professor Meade in distinguishing 
two different cases concerning the production function. For this, we 
write ¢, in explicit form as: 


d,(t +1) = w,[d,{t + 1), «++, d,,(t + 1), t) (t); 
d(t), et -1), g(t —1)). 


The arguments which firm 7 can contro! in period t are d,(t +1), e,(t), 
and q,(t) only. Of course, it can contro! d,(t), e(t — 1), and q,(t — 1) 
{which are included in the vectors d(t}, e(t—1), and q(t—1), respectively] 
in period ¢ — 1 but not in period i, so that d(t), et —1), and q(t — 1) 
play the role of parameters in the above function. We now distinguish 
between: (1) those cases in which d,,(t + 1) is homogeneous of degree 
one in d,({t +1), +++, dit +1), e(t), g,(t), dit), et — 1), g(t — 1); and 
(2) those cases in which d,,(¢ +1) is homogeneous of degree one in 
d,(t +1), «++, d(t+1), e(t), and gq,(t), but not in all arguments 
including d(t), et —1), and q(t —1). Meade calls the first case the 
‘‘unpaid-factor’’ case and the second the case of the ‘‘creation of 
atmosphere.’ In this paper, we are concerned with the former case 
only. 

Let p(t) be the price of good 7 in period /, w(t) the money vage- 
rate in period t, and ot) the interest factor in the same period (=1-+- 
the rate of interest in period t). All of them are given to each firm 
as determined in the market. Let p‘(t +1) be the price of good 7 in 
period t +1 which is expected by firm 7 in period ¢. Suppose firm 1 
has a given number of workers. q,(t), available for production in period 
t. Then the firm chooses the optimum factor-product combination, such 
that the capitalized value of profits 

1 rn : “~ ” 
1S p(t + 1)d,(t + 1)| — dS pte,(t) — w(te,(t) 
A(t)ui= Jt 41 
is a2 maximum subject to the production function ¢, This schedule 
5 See [5] p. 56 and p. 67 
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gives d,(t +1) and e,,(t) as functions of p*(t + 1), p(t), A(t), g.(t), a(t), 
et —1), and g(t—1). Now assume static expectations such that 
p(t+1)= p(t). Since d,(t +1) and e,,(t) are homogeneous functions 
of degree zero in p*(t + 1) and p(t), the functions under the assumption 
of static expectations can be written as: 
df(t + 1) =fi.[y(t), A(t), a(t), dd), et — 1), g(¢—1)], 
e,(t) = g.y(t), A(t), a(t), a(t), et — 1), a(t —1)], 


where y(t) = [y,(t), «++, yn(t)], and y(t) = pit) pt). Furthermore, 


from the assumption that ¢, is homogeneous in d,t+ 1), e,(t), 9,(t), 
d(t), et — 1), and g(t — 1), we find that d,,(¢ + 1) and e,,(t) determined 
above are homogeneous of degree one in q,(t), d(t), e(¢ — 1), and g(t — 1); 
thus 


Ad, (t + 1) = f,[y(t), S(t), at), Ad(t), re(t — 1), Aa(t — 1)] 
and 
re, (t) = g.ly(t), A(t), rat), Ad(t), re(t — 1), Ag(é — 1)) 
hold for any value of scalar X. Writing 
d,ft + lj/g(t) =5,(t+1), 
€.<t)iqdt) = et) , 
gt + 1)/9,(t) = a(t), 
at)/S alt) = 2,0), 
and putting \ = 1/¢,(t — 1), we obtain: 
Md, ft + 1) = df(t + L/gd(t — 1) = aft — 1)6,(t +1), 
re, f(t) = e,(t)/9,(t — 1) = a(t — 1)e,{t) , 
Ad, (t) = d,(t)/q(t — 1) = b,,(t)a{t — 1)/a(t —1), 
re, ft — 1) = eft — L/g(t — 1) = e,,(t — 1)aft — 1)/2,(t — 1), 
AGAt — 1) = g(t — I/a(t — 1) = aft — 1)/x(t —1). 
Hence we find: 
a(t —1)6,(t+ 1) =f,,{y(), A(t), a(t — 1), b(t), e(¢ — 1), x(t — 1)], 
a(t — 1)e,,(t) = g,,[y(t), A(t), a(t — 1), 6(t), e(t — 1), a(t — 1], 


where b(t) - [.,(¢), ee ban(t)], c(t — 1lj= [e,,(¢ —1), +, Cuan(t — 1)], and 
a(t — 1) = [x(t — 1), «++, 2,,(¢ — 1)]. 

Let us now concentrate our attention on the state of balanced growth 
where q,(t), «++, @,(t) grow or decay in the same proportion a, and 





BALANCED GROWTH OF FIRMS 133 


all the prices y, the growth factor a(=1+ the rate of balanced growth), 
the interest factor 8, and the technological coefficients b and ¢ are 
constant over time. In that state, 


a(t)=-+--=a,(t)=a; AD=8f, 
atj=x, yt)=y, Wt)=b, c(t)=c 
hold for all ¢. It is known that under some suitable conditions there 
exists one and only one system of continuous functions 
bs = 5, (x, Y;, a, 8) ’ 
Cys = C, (2, y; a, 8) , 
which satisfies the equations ab,, — f,,{y, 8, a, 6, ¢, x) = 0 and ac, — 
ady, 8, a, 6, c, e) = 0 (i =1,+++, m; fj =1,---, n).° Note that the 


row vector’ 2 = (x,,+++, z,) with 2,20, }>#,=1 and the column 


re=l 


vector y = (y,, +++, yn) with y, 20, }>y, =1 denote the allocation of 


labor among firms and the normalized prices of goods, respectively. 
Next we assume that capitalists only save® and workers only consume. 
Let ht) be workers’ consumption of good j in period t. Then we have 


w(t) Sault) | = waltyha(t) + +++ + paltValt) 


or 
w(t) = p,(t)k,(t) $f coe + Pa(t)k,(t) ’ 


where k(t) = h,(t)/ > q(t). According to the usual theory of consumer 


behavior, workers’ demand for consumption goods allows substitution 
in response to price changes. In this paper, however, we assume that 
k(t), «++, k,(t) change proportionately, i.e., k(t) = yk, (fj =1, +++, n), 
where k,, +++, k, are constant. The proportionality factor 7 is referred 


to as the level of consumption. Writing w(t) = w(t) |< pt), we obtain 


w(t) = yk,y,(t) + °o° > Yen n(t) ° 


In the following two sections, y is assumed constant and treated as 
a@ parameter in our system. The determination of the consumption 
level y will be discussed in the last section of this paper. 


® See [9] pp. 16-17. 

1 In our model, the vector z plays the same role as the activity vector z in the usual 
von Neumann models. Our vector z is referred to as the labor allocation vector. 

& The case in which capitalists’ average propensity. to save is less than one is discussed 
in [6]. 
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Let us now write: 
a(x, y, &, B, Y=afz, y, a, 8) + vk, 
A(z, y, a, 8, Y) =|l|afz, y, a, B, I, 
Bix, y, a, 8) = || b,(z, y, a, 8)|I. 


In order for all firms to expand at a constant rate a, the following 
cond tions must be satisfied: 


(1) a[{B(x, y, a, 8) — aA(z, y, a, 8, y)]) 29, 
(2) a[B(xz, y, a, 8)— aA(z, y, a, B, rly =9; 
(3) [B(x, y, a, 8)— BA(x, y, a 2, Vly SO, 
(4) a[B(xz, y, a, 8) — SA(zx, y, a, B, yy =0, 
(5) rB(xz, y, a, Syy>O. 


These conditions have the same interpretations as the analogous ones 
in the typica! von Neumann models [2, 7], namely, (1) implies that it 
is impossible to consume more of a good than is being produced, (2) 
makes overproduced goods free, (3) makes firms profitless, (4) assures 
that unprofitable firms do not employ workers, and (5) requires that 
pricing and production schemes are such that something of value is 
produced. 

To give a game-theoretic interpretation to these conditions let us 
imagine two fictitious economic persons whom we shall call the producer 
and the pricer. The producer controls the labor allocation vector x 
and actually is made up of many people;’ in fact, all the entrepreneurs 
who are involved in production decisions in the economy make up, 
jointly, the producer. The pricer controls the price vector y and is 
also made up of many people, that is, all the people who participate 
in pricing decisions. As we shall see in the next section, it is possible 
to interpret the actions of the pricer and producer as being those of 
participants in a zero-sum two-person game. In fact, if the possible 
actions of these two players were to be spelled out in detail, one would 
obtain the extensive form of an ‘‘economic game’’ for which expres- 
sions (1)-(5) are the normalized form (cf. [3] or [8)}). 


2. THE EXISTENCE PROOF 


As in [2], we need not expect equilibria to exist without making 
assumptions concerning A and B. The assumptions to be made here 
are the following: 


® The device of treating many people having common interests as a single player is 
common in game theory ‘cf. [8]). Thus, bridge is considered a two-person game in 
which a player consists of two vartners (e.g., North-South or East-West). 





BALANCED GROWTH OF FIRMS 185 


(i) The functions a(x, y, a, 8, ¥) and ba, y, a, B) are con- 
tinuous for all (x, y, a, 8) in the set 


S={(z, y, a, )|%, 20, ya = 1, 


y, = 0, Du=1,0za2z0 and oz 20}, 


where o is a positive constant. 

(ii) For all (x, y, a, 8) in S the inequalities A(z, y, a, 8B, y)=0, 
B(x, y, a, 8) 20, and A(z, y, a, 8, 7) + Bx, y, a, B)>O hold. 

(iii) For all (x, y, a, 8) in S the inequalities v[B(x, y, a, 8)] >0 
and v[ B(x, y, a, 8) — cA(x, y, a, 8,Y)]<0 hold, where B and B—@A 
are considered as matrix games in which the maximizing player con- 
trols the rows and the minimizing player the columns, and v(B) 
and v(B — aA) are the values of each of these games. 

Assumption (i) is a continuity assumption that prevents a,, and ,, 
from changing discontinuously from one point (x, y, a, 8) to another, 
while Assumption (ii) is like the original von Neumann assumption in 
[7]. As for (iii), v[B(z, y, a, 8) — cA(x, y, a, B, ¥)] < 0 merges into 
v[— A(z, y, a, 8, y)] <0 if o is sufficiently large. v[B(x, y, a, 8)| > 0 
and v[—A(z, y, a, 8, y)| <0 are like the assumptions made in [2}. 

In order to interpret the problem in game-theoretic terms we will 
need the following lemma: 


LEMMA 1. If az, y, a and 8 are solutions of (1)-(5), then a= 8B. 


PROOF: From (5) we see that «B(x, y, a, Bjy > 0; hence from (2) 
and (4) we have xB(z, y, a, By = axA(a, y, a, B, Y\y = BxA(a, y, 
a, 8, Yyy> 90. From the last relation we have xA(z, y, a, B, yy >90 
so that a=8=2B(z, y, a, B)jy/xA(x, y, a, B, Vy. 


The lemma tells us that we need look only for solutions in which 
the interest factor is equal to the growth factor. Under the assump- 
tion a=, equations (2) and (4) become identical and, in fact, are 
easy consequences of expressions (1) and (3) and the fact that x and 
y are non-negative vectors with unit sums. Writing B(z, y, a) = 
B(x, y, &, a) and A(x, y, a, ¥)= A(z, y, a, @, y) and making the 
abbreviation M,(z, y, a, 7) = Biz, y, &) — aA(z, y, a, ¥), We may 
write expressions (1), (3) and (5) as: 


(1’) Mz, y, & Y) 20, 
(3’) Mz, y¥, % ywso, 
(5’) cB(z, y, ay>o. 
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These three expressions do not define a game, and it is not immediately 
clear how they should be interpreted. If, however, we let matrices 
A and B assume constant values, we can interpret this as an ordinary 
expanding economy model. In order to do this we choose x*, y*, and 


a* such that z* 20, > at =1, v3 =9, ual and g =a* 20. 
t-1 « 


Consider the following expressions: 


(1”) aM(x*, y*, a*, yy 20, 
(3”) M(x", y*, a*, ys, 
(5”) eB(x*, y*, a*y>O0, 


where M,(x*, y*, a*, 7) = B(x*, y*, a*) — aA(x*, y*, a*, 7). Because 
of Assumptions (ii) and (iii) and the theorems in [2], [7], and [10], 
there is always a solution to these inequalities; moreover, the growth 
factor a is uniquely determined.” Let a** be the unique solution; 
then we obtain 


LEMMA 2. a** is positive and less than o and depends continuously 
on the variables (x*, y*, a*). 


Proor: Let (uz, we, 7, A¢) be any sequence of points in S that 
converge on the point (x*, y*, a*, a*) and let \j* be the correspond- 
ing growth factors. Since the entries A and B are non-negative, 
v(M,,) is a monotonic decreasing function of \». By Assumption (iii) 
e*’s are positive and less than g; thus Aj*’s are contained in a 
bounded portion of the real line. Hence we can pick a convergent 
subsequence X** with limit \**. The continuity assumption (i) implies 
that lim A(ut, wt, AF, v) = A(x*, y*, a*, y) and lim Biu*, wt, »F) = 


B(z*, y*, a*). Since o[M,-(ut, wt, 7, ¥)] = 9 for every t implies 
v[M,..(z*, y*, a*, y)] =0, we see that »,** = a**; otherwise, the 
uniqueness of the growth factor [insured by Assumption (ii)] would be 
violated. If the sequence \;* itself does not converge, then we can 
pick two different convergent subsequences with different limits which 
would also violate uniqueness; hence lim »3* = a**, and a** depends 


continuously on the variables (x*, y*, a*). Of course, 0 < a** <a. 
Let us substitute a** for a in (1”), (3”) and (5”), obtaining 


W If the matrices A and B have constant entries and Ma= B-caA satisfies the Assump- 
tions (ii) and (iii) above, then by the results in [2] and [10] there is a solution to ine- 
qualities (1’’), (3’’) and (5’’). Such a solution consists of an expansion factor a and a 
pair of vectors z and y that satisfy these inequalities. Finally, the result in [7] shows 
that the expansion factor found is unique. 
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2 2M,Jjz*, y*, a*, 20, 
(3’”) M,..(x*, y"*, ar, yy Ss 0 ’ 
(5’”) xcB(x*, y*, a*y>O. 


Let T(x*, y*, a*) be the following set: — 
T(z*, y*, a*) = {(z, y)| expressions (1’”) and (3’”) hold} . 
We then obtain 


LEMMA 3. The mapping (x*, y*, a*)—(T(x*, y*, a*), a**) is an 
upper semi-continuous mapping of the set 


U={@, v, ain 20, Dm =1, 4,20, Dy, =1 and ozaz0} 
into itself. Moreover, T(x*, y*, a*) is convex. 


Proor: It follows from Lemma 2 that all the entries of the game 
M,,.{x*, y*, a*, ) depend continuously on the variables (x*, y*, a*). 
Since the set of optimal strategies for a game varies in an upper 
semi-continuous way with the entries of the game, (x*, y*, a*)— 
(T(x*, y*, a*), a**) is an upper semi-continuous mapping of S into 
itself. The convexity of T is obvious. 

We can now establish 


THEOREM 1. A solution exists for inequalities (1’), (3’) and (5’) if 
Assumptions (i), (ii) and (iii) are satisfied. 

Proor: By the Kakutani fixed point theorem [1] the mapping defined 
in Lemma 8 has a fixed point, say (x*, y*) e T(x*, y*, a*) and a* = a**, 
Then (x*, y*, a*) satisfies (1’) and (3’), so we have 


(6) x*[B(x*, y*, a*) — a*A(a*, y*, a*, yy* =0. 
By Assumption (ii) we also have 
(7) a*[B(x*, y*, a*) + A(x*, y*, a*, y)y* >0. 


Substituting from (6) into (7) we obtain 
(1 + a*)a*B(a*, y*, a*y*>0. 

Therefore, we conclude that «*B(x*, y*, a*)y* > 0, which shows that 
(5’) holds, and we have established the existence of a state of balanced 
growth. 

REMARK. So far we have assumed that a is finite. If @ is infinite, 
Assumptions (i), (ii) and (iii) are replaced by: 

(i’) The functions a,{z, y, a, 8, y) and bz, y, a, 8) are con- 
tinuous for all (x, y, a, 8) in the set 
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& = {(, y, a, 8)\a%, 20, ye, =1, 
y,29, Dv =1, a=0 and R20. 


These functions approach certain limits as both a and £ become 
infinite, and those limits are continuous for all pairs of non-negative 
vectors x and y with unit sums. 

(ii’) For all (x, y, a, 8) in S’ the inequalities A(x, y, a, 8, y) = 9, 
Bix, y, a, 8) 20, and A(z, y, a, B, ¥) + B(x, y, a, 8) > 0 hold, and 
lim [A(z, y, a, B, 7) + B(x, y, a, 8)| > 4. 

(iii) For all (x, yu, a, 8) in S’ the inequalities v| — A(x, y, a, 8, y)|<0 
and v[B(x, y, a, B)| > 0 hold, and lim w[—A(z, y, a, B, Y)| #0 and 
lim of B(x, y, a, 8)] # 0. 

@,B-o 
By the same method as above we can prove 


THEOREM 2. If Assumptions (i’), (ii’) and (iii’) hold, the system 
(1)-(5) has a solution (x*, y*, a*, 8*) such that (x*, y*, a*, B*)eS’ 
and a* = f* = finite. 


8. EXTERNAL ECONOMIES AND THE RATE OF BALANCED GROWTH 


In this section we are concerned with the effect of external economies 
upon the rate of balanced growth of firms. In the absence of any 
external economies, we have the simplest (independent) production 
functions: 

(8) F,[b,(t + 1), ete bialt + 1), ¢,,(t), - Ips Cin(t)] =0 (2 = 1, oie, m), 
assuming that there are constant returns to scale in each firm and 
that the production period is unity. In equilibrium the capitalized 
value of profits per worker, 
1 n ‘ n 
Yo wut + Dole +1) ]— Yo pithealt) — w(t), 

A(t)La=a j=1 
is at a maximum subject to the production function (8), so that, under 
the assumption of static expectations, 5,(t + 1) and ¢,,t) depend on 
y(t), and A(t). If we are restricted to the case of balanced growth 
in which prices and the rate of interest are constant over time, we 
obtain 


6, = 5fy, 8), 
C, =C,y, 8). 


Let us write: a, fy, B, Y) = ey, 8) + yk, A(y, B, ¥) = I! ay, B, vil, 
and B(y, 8) = || 5,{y, 8)\|. The expressions corresponding to (1)}-(5) 
in Section 1 now become 





BALANCED GROWTH OF FIRMS 189 


(9) x[B(y, 8) — aAly, 8, y)]) 20, 
(10) x[Biy, 8) — aAty, B&, y)\ly=9, 
(11) [Biy, 8) — BAYy, B, yy sO, 
(12) 2[Bly, 8) — BA(y, 8, Yly =0, 
(13) «Bly, By >. 


Under conditions similar to (i)-(iii) all of the work in Section 2 holds, 
and the existence theorem insures that the system (9)-(12) has at least 
one economic solution (Z, y, @, #). 

It is plausible enough to assume that any factor-product combina- 
tion, feasible in the absence of external economies, is still feasible 
under external economies, but no longer efiicient. Under this assump- 
tion we can obtain 


THEOREM 3. The presence of external economies gives rise to an 
increase in the rate of balanced growth. 


PRooF: Let 2’, y’, a, and f° be the solution of (1)-(5). The profits 
with the chosen output- and input-coefficients, 5,,(z°, y’, a’, 8°) and 
c,(x°, y’, a’, 8°), when normalized prices are y’ and the interest factor is 
8°, should be greater than the profits which could have been made had 
any other feasible, but inefficient, combination been chosen, so that 


LE > bY, By} ~ Do eild, Ay’ - & 
8° 3-1 jai 
(14) - 
< = DD b,(2’, y’, a’, By} ~ >» C(x’, y’, a, Sys =—@, 


This follows, since the combination 
[b.,(y, B), mn Oe bin(¥, B), ie C(Y, 8), sa Cinl(Y, R)] 
is still feasible, but not efficient, when there are external economies. 
Together with >> vk,y} = w, (14) leads to: 
jel 
< Bix’, y’, a, By’ — BA(x’, y’, a, By’. 


From (3) it follows that the right-hand side of (15) does not exceed 
zero. Hence 


By, By < BAY, B, vy. 
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Multiply this by Z, obtaining ZB(9, A)y’ < f°2 A(¥,A, v)y’; and multiply 
(9) by y’, obtaining ZB(y, B)y = AZA(y, B, vy’; these two expressions 
imply that 6° >&. Since 6° = a’, we get a* > &, completing the proof. 


4. DETERMINATION OF THE CONSUMPTION LEVEL 


So far we have treated the consumption level y as a parameter and 
determined the 2’, y’, a and §° corresponding to the y. In this sec- 
tion we are concerned with two situations with the same rate of 
growth of population, in one of which there are external economies, 
while in the other they are absent; the consumption levels in these 
situations are compared with each other. 

We begin by establishing the following lemma: 


LemMMA 4. If the consumption level tends to infinity, the rate of 
balanced growth tends to —1. 


ProoFr: It follows from (2) that 


2B(z, y, a, By = axC(z, y, a, By t+ ra 3 Kes « 


Since (z, y, a, 8) belongs to the set S, and since B(z, y, a, 8) is 
continuous for all (z, y, a, 8) in S, xB(z, y, a, B)y does not exceed 
a properly chosen constant. On the other hand, 2zC(z, y, a, Ay is 
non-negative. Hence a must tend to zero as 7 tends to infinity. 


Let us now assume that the system (1)-(5) determines a unique rate 
@ corresponding to a given y. Then it is easily seen that the a isa 


a 














ce 


“h 
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continuous function of y. As Lemma 4 holds, the function a,j has a 
shape like curve aa in Figure 1. Curve a’a’ stands for the function 
a(y) in the absence of external economies. By virtue of Theorem 3, 
it must be located below aa. 

Suppose the supply of labor in period ¢ is p‘N, where the rate of 
growth of population o — 1 is assumed constant. Since the demand 


for labor in period ¢ is a‘ } q(0), the demand-supply balance of labor 
is described as: 


a » q(0)=p'N for all t, 
which is equivalent to the following two equations: 
¥ a0) = N 
a=p. 
If max a(y) =>, there exists at least one y such that a(y) = 9p, cf. 


Figure 1. But if maxa(y) <p, then even if >> 4,0) = N, we will 
t=1 
have 


ae . q{0) < oN fort=1. 


Hence unemployment of labor will be inevitable, and we would expect 
that the rate of real wages (and hence the consumption level y) will 
be pushed down to the subsistence level. (In any case, the consump- 
tion level cannot be determined within our model for this situation.) 
The balanced growth associated with the greatest y among those which 
make a@ and p equal may be called the golden balanced growth, and 
that y may be referred to as the highest allowable level of consump- 
tion or the golden level of consumption. The theorem which follows 
is immediately obvious from Figure 1. 


THEOREM 4. The highest allowable level of consumption in the 
presence of external economies is higher than that in their absence. 


Osaka University, Japan 
Carnegie Institute of Technology, U.S.A. 
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THE KEYNESIAN THEORY OF INFLATION: 
THE TWO FACES OF JANUS ?* 


By SIDNEY WEINTRAUB 


CONSIDERING THE high unanimity with which writers of the Keynesian 
stamp approach problems of total employment and aggregate output, 
it is somewhat odd and baffling that on the great practical problem 
of the decade now closing, specifically, the inflation issue, their views 
have split them into separate camps of ‘‘demand-pull’’ and ‘‘cost-push,”’ 
or some uneasy amalgam of the two, as with varying intensity they 
have examined the always partial and inconclusive empirical evidence. 
On the inflationary subject, I think that differences have tended to 
submerge likenesses, so that we have an unaccustomed array of op- 
ponents in the intellectual arena. 

A “school’’ in dispute is obviously an interesting spectacle and must 
expect to draw attention to its plight. At the same time, the split 
intellectual personality that I think I detect on this one great issue 
is undoubtedly capable of generating new contradictions over time on 
other problems of greater or lesser importance. 

In this paper I am not mainly concerned with the question of which 
of the two major inflation doctrines best fits the facts, or whether 
both should not be scrapped in favor of renewed emphasis on monetary 
factors and a Fisher-type equation of exchange. I have expressed 
myself elsewhere on this general subject and have little more to add 
to it at this time.’ What concerns me is a more fascinating theoretical, 
or methodological and philosophical matter, to wit, why economists who 
agree that the cure for unemployment is to use remedies stemming 
from the theory of the consumption function, the investment functicn, 
and the rate of interest, should find themselves at loggerheads and 
arrive at an impasse when it comes to an analysis of inflation and 
the price level. It is the origin of this Janus-head on inflation, the 
two voices from one source, that I should like to unravel in the 
following pages. Perhaps greater clarity on the theoretical plane will 
help sharpen the important issues with which the present generation 
of economists must learn to cope. 

* Manuscript received January 7, 1960. 

1 See my A General Theory of the Price Level, Output, Income Distribution and 
Economic Growth (Philadelphia: Chilton Company, 1959). 
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At the outset I should like to state firmly that I believe this 
‘muddle in the middle’’ is more of the making of Keynesians than of 
Keynes himself; I think I ‘‘know’’ what Keynes would have emphasized, 
but I refrain from any attempt at documentation, for it is equally 
possible for those who hold alternative views to find appropriate sup- 
porting passages in Keynes’ General Theory or iis later discussion 
in How to Pay for the War to sanction their doctrinal interpretation. 
If nothing else, this demonstration of different brands of Keynesianism, 
which have often been thought of as fully substitutable wares, discloses 
that there may be new rifts brewing under the quiet facade which 
may establish divergences in directions currently unsuspected. 

One further prefatory remark is in order. I have refrained from 
identifying proponents of the conflicting views and from identifying 
them with analytic error for several reasons. In the first place, some 
parts of the very imposing theoretical edifice have now become common 
property at the elementary and teaching level, so that, if I am right, 
the error is widespread. Second, much of this apparatus was con- 
structed without this special problem in view, so that some interpreta- 
tion is necessary; it is a hard matter to disentangle the price level 
analyses embodied in even the most rigorous versions of the theory. Third, 
it has happened that with what I regard as the natural apparatus of 
“‘excess-demand”’ and ‘‘demand-pull’’—using these terms synonymous- 
ly—some of the technicians may, and do, adopt a ‘‘cost-push’’ posture 
in approaching public policy, and vice versa. None of us are entirely 
free of such contradictions within our theoretical structures, and so 
more than the usual pains would have been required to make sure 
that the personifications were not unfair. In any event, it is the split 
personality in ideas among the school, not the ‘‘error’’ of individuals, 
that I want to explore.’ 


1. THE 45° LINE AND THE KEYNESIAN EQUILIBRIUM CROSS 


The dominant brand of Keynesian analysis, it can be said without 
fear of contradiction, runs in terms of the 45-degree line bisecting 
the coordinate field containing the familiar and assorted paraphernalia 
of consumption function and investment magnitude. I shall examine 
this diagram in a moment and, as it can be found in practically every 
modern-day textbook, the examination can be brief. It is this ap- 


2 Originally, as I began to document this paper, I found that the footnote discussion 
was far outrunning the textual material, and to little or no purpose. On reconsideration 
and in asking constantly whether the minor thumbnail analyses were fair to the authors 
| decided to present merely the major ideas. 
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paratus, with its associated real income equilibrium determination, 
which has been compared in importance with the demand and supply 
curves of price theory, and the equilibrium ‘‘cross’’ of the one identified 
as equivalent in principle to the equilibrium intersection of the other. 

Used somewhat less frequently, and emanating from Professor Hicks’ 
masterly exposition many years ago, are the pair of JS and LM curves 
which are supposed to give more generality to the argument by show- 
ing the simultaneous macroeconomic adjustment of aggregate output 
and the rate of interest.’ Both approaches—and both are really the 
same for my present purposes, for each rests on the same real output 
basis—represent the main wing of neo-Keynesian analysis and are 
equally regarded as Keynesian economics, pure and simple or profound 
and unadulterated. They have come to occupy a complete teaching 
tradition, at least in this country, and perhaps through the world, in 
view of the world-wide dissemination of American textbooks. The 
ideas may well constitute a fairly universal version of Keynesian 
economics. 

In Figure 1 there is drawn the diagram of the 45° cross which 
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® Professor Hansen has probably taught us most about this approach which still remains 
above the elementary textbook level. See his Monetary Theory and Fiscal Policy (New 
York: McGraw-Hill Book Co., Inc., 1949), Chapter 5. 
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typifies this tradition. The 45° bisector represents the potential 
equilibrium relationships in a diagram in which real income or real 
output, measured horizontally and denoted by Y, is associated with 
real outlay in the form of consumption expenditures (C) and invest- 
ment outlay (7), also in real terms, and measured vertically. Note 
that all quantities are in real terms; that is, C, J, and Y are money 
sums measured in constant dollars. Changing price level phenomena 
are thus completely eliminated from the picture; real magnitudes alone 
provide the focus of attention. 

The diagram, as drawn, contains the now commenplace figure of the 
consumption function, lettered C*, and has superimposed on it a 
constant sum of real investment (as in the more typical expositions) 
to comprise the real outlay function (C* + J*). The equilibrium inter- 
section is at a volume of real output equal to Y*. Here Y* =C*+ /*, 
so that macroequilibrium prevails. 

Suppose that full employment output is located at position Y,. 
Clearly, an aggregate demand (C* + J*) is inadequate and real output 
at Y* is too low, carrying unemployment with it. Usually, the 
familiar gamut of measures of fiscal policy and monetary maneuvers 
are recommended to snap the economy out of the unemployment 
doldrums unless exogenous factors are believed to be in the offing to 
accomplish the same objective without deeper government intervention. 
In general, an equilibrium output level to the left of Y, is described 
as a deflationary situation of unemployment, ripe for government 
intervention and compensatory public policy of weak or strong degree 
in order to lift aggregate demand to (C, + I,). 

It is well that we follow the reasoning of the ‘‘stability analysis’’ 
by which Y* is described as the ‘‘equilibrium position.’’ Suppose that 
somehow actual output settled at Y,. Aggregate demand in real 
terms, it would be explained, would amount to D,Y,, an amount in 
excess of the real output produced (Y,Y!) by the real difference D,Y, 
minus Y,Y{. Would this mean that there would be some incipient 
‘‘inflation’’? 

I think the usual answer would be yes, that the D,Y, minus Y,Y{ 
gap would constitute ‘‘excess demand’’ and that this would be infla- 
tionary, so that the system would be driven to the Y* output level. 
The peculiar nature of this ‘‘inflation’’ should be observed. Jt is an 
inflation that does not have, or need not have, any price rise, inasmuch 
as all of the analysis is cast in real terms so that, presumably, the 
price level at Y, is the same as at Y*, at Y,, at Y;, ete. Price 
levels are excluded from the analysis, as being in some way incidental, 
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so that only through some back-door approach can they be brought 
in. The only valid interpretation of the disequilibrium position at Y, 
is that if output temporarily settled here in some ‘‘round’’ of produc- 
tion, there would be ‘‘inflation’’ in the sense of inventory decumulation 
at the constant price level, equal in amount to the D,Y{ discrepancy. 
The run-down of inventory holdings would constitute an expansionary 
phenomenon, and a production advance toward the equilibrium ‘‘cross’’ 
would be in order.‘ Likewise, at an output level of Y, there would be an 
‘‘unwarranted”’ or ‘‘unintended’”’ inventory accumulation of Y{D,, so that 
pressures would be apparent to drive real output back to Y*. Deflation- 
ary forces would thus start the output cutback and unemployment train. 

Other than the minor ‘‘inflation’’ analyzed as part of the stability 
analysis, the interesting question is how a truly inflationary process 
would be analyzed by Keynesians using this approach. I think that 
the majority of Keynesians would argue that in inflation what must 
be happening is a parametric rise in the C+ J curve to a position 
well above the ‘‘full employment equilibrium’’ function C, + J,, drawn 
as a dashed line in Figure 1, so that true ‘‘excess-demand’”’ and 
‘“‘demand-pull’’ materializes in the system, with demands for real 
output outrunning the capacity of the system to satisfy it. As output 
cannot be expanded once full employment is reached, they would argue 
that prices would have to give way. From an analysis of this nature 
the Keynesians would conclude that the necessary policy steps to avoid 
a more alarming price level outbreak would be to choke off ‘‘excess- 
demand”’ or the ‘‘demand-pull’’ by monetary and fiscal means, or 
some combination of the two. (Those who take a less serious view of 
the durability and buoyancy of the demand forces would probably 
countenance learning to live with ‘‘creeping-inflation,’’ as a minor 
price of full employment.) 

Where do money wage rates enter the picture? Unfortunately, these 
have often gone undiscussed. Money wages are assumed constant in 
the basic argument, and the aggregate demand forces, in real terms, 
have been accorded the center of the stage. Where money wages are 
brought in it is often merely to aver that these will not rise unless 
there is full employment and excess demand. If they do rise, the 
argument seems to lean to the view that their upswing merely supports 
the price level that has already jumped up because of ‘‘excess-demand.”’ 
Schematically, because it fits so badly into the formal apparatus which 


‘ It is surprising to learn that often there is no discussion of the ‘‘ stability conditions.” 
The stronger versions do run in the terms indicated, and properly so, of inventory ac- 
cumulation and decumulation at nonequilibrium positions. 
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runs in real terms, when this group is pressed on the subject, some 
rather gratuitous remarks on the effect of money wage changes on 
the distribution of income and hence on the level of aggregate demand 
are often tendered. But such effects are generally accorded rather 
meager attention. 

In sum, as inflation is a creature of excess-demand, there is no 
serious price level distortion until full employment is reached. Simi- 
larly, the money wage level may be disregarded or accepted as a 
datum. Typically, exogenous wage movements through union pressure 
and business acquiescence are ignored unti! full employment. And 
then, it is the demand level that is stressed and creates the illusion 
of the ‘‘cost-push,’’ as this school sees it. 

The more sophisticated version of Keynesianism of this same stamp, 
with claims to more generality and insight, seeks to show the simul- 
taneous determination of interest rate and real income level through 
the interdependence of the self-same functions. Thus in Figure 2 
there is the well-known JS curve, which is built out of the real in- 
vestment response to interest rate changes, the consumption function, 
and the C + I identity, and is associated with various levels of real 
income or real output measured along the horizontal axis. In deriving 
TS it is the interest rate which is assumed to be the parametric 
constant, variations in which affect investment and thus the JS path. 
The LM curve is erected out of the remaining equations of Keynesianism, 
namely, the liquidity function and the money supply, regarded or- 
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dinarily asa datum. In deriving the LM path it is the income level— 
real income’—which is temporarily viewed as the parameter. The 
diagrammatic apparatus thus permits the disclosure of the mutual 
income-interest rate values that are simultaneously equilibrating. In 
Figure 2 the equilibrium values are i* and Y*. Once again, if full 
employment real income is at Y,, achievement of this zenith will 
require parametric movements of the two functions, through consump- 
tion, investment, or interest rate developments. 

Until ‘‘full employment,’’ in neither of these versions of Keynesian- 
ism—which are regarded as equivalent for my purposes—is there any 
substantial concern with inflation, for all its concepts are cast in real 
terms. The income levels being talked about have no explosive price 
levels, nor even changing price levels, for all this is assumed away. 
Surely, there is no concern with money wage changes—the ‘‘cost- 
push’’—for the very application of the technique precludes this dis- 
cussion: it is competent to handle variations in real phenomena, not 
in monetary phenomena. There is no way in which it can fit in, say, 
added monopoly power in the sense of discussions of advancing levels 
of administered prices where these are deemed important. For both 
changing monopoly power and changing money wages at full employ- 
ment or less have their impact on price levels rather than on real 
output. But these are the very problems which have been exorcised 
by this technique which runs in real terms. 

As one reflects on it, it becomes ever more astonishing that diagrams 
such as those examined have been regarded as useful in illuminating 
inflation when their very essence consists of conceptions running in 
real terms, thereby eliminating the very problem of changing price 
levels, whether due to new issues of money, or velocity changes, or 
money wage changes, or changes in productivity, or monopoly power 
—or anything else. This wing of Keynesianism, whose hallmark is 
either the analysis surrounding Figures 1 or 2, or other similar ver- 
sions running in real terms, has justifiably been charged as devoid of 
a basic explanation of inflation.’ I think this criticism is merited, for 


6 I have criticized the inclusion of the interest rate and the real—not money—income as 
the strategic variables in the liquidity function, and have suggested that the price level 
must also be included as a major parameter. See my Price Level, op. cit., p. 72. I 
have been surprised at certain protests to the effect that the price level has always been 
included in the money-demand equation. I submit that an examination of the literature 
does not bear this out though in common discussion economists will invariably acknowledge 
its importance. 

® Richard T. Selden, ‘‘Cost-Push versus Demand-Pull Inflation, 1955-57," Journal of 
Political Economy, (February, 1957), p. 5n. 
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in the last two decades, since 1940, the economy can be viewed as 
operating under conditions of strong aggregate demand, so that levels 
of real demand have not been in issue, and yet the numéraire pheno- 
menon of changing money wage levels which fit uncomfortably in a 
‘‘real’’ scheme has been barging upwards. It is little wonder that 
analyses springing from the Equation of Exchange have become more 
acceptable today than in the immediate heyday of Keynesianism. 

Lest it be thought that I am dealing only with straw men picked 
only from the softer geometrical treatments presented to the neophyte 
students of economics, an examination of some mathematical models 
of the same stripe and in the recognized sources discloses exactly the 
same arguments; there is no inconsistency between mathematical and 
geometrical versions on this score, although I do not rule out the 
possibility that more elaborate Keynesian modcls may deal more ade- 
quately with price level phenomena. In those systems that take 
enough pains to include price level analyses rather than working with 
real phenomena right from the start, the most common hypothesis is 
that the money wage level is constant, with the usual reservation 
that this holds only until ‘‘full employment’’ obtains. What happens 
to the price level along the course of the associated C, J, or savings 
functions until full employment is reached is literally never discussed 
in these accounts, according to my experience in examining the standard 
works. The typical assumption seems to be that prices hold constant, 
or variations are so small that they may be neglected, until full 
employment is reached. Unless this proviso is mentally inserted, 
there is no simple way of interpreting the geometrical figures like 
Figures 1 and 2 that usually accompany the equational statements. 
One less frequently quoted equational system makes the money wage 
a function of the level of employment. Apart from the problems 
(undiscussed in the original) with respect to ‘‘money illusion,” this 
simple function makes the point better than I would expect, for it 
leads inevitably to the argument that the high levels of money wages 
occur only in the vicinity of full employment: it would have no way 
of explaining why, in recent years, with little or no change in employ- 
ment levels the money wage keeps mounting. 

Thus I think I have presented fairly the dominant ideas of ‘‘demand- 
pull’’ Keynesian analysis on the inflation issue by way of the formal, 
and now not so formidable, diagrammatic and mathematical apparatus. 
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2. AGGREGATE MONEY DEMAND AND AGGREGATE 
MONEY SUPPLY FUNCTIONS 


Let us now turn to the other face of Janus, to look instead at an 
alternate diagrammatic approach which, although not as commonly 
employed as the 45° line, still has a literature of itsown. As I have 
used this apparatus in several writings over the past few years, the 
exposition of the main points can be made briefly.’ The ‘‘cost-push’’ 
argument comes out of it quickly, effortlessly, and directly. 

In Figure 3 uncorrected money levels of actual and expected ex- 
penditure are measured vertically, and amounts of labor are measured 
horizontally. All labor is assumed to be homogeneous, initially at least; 
this is a simplification not worse (more acceptable in my view) than 
the use of homogeneous units of real output (Y) in the dominant 
Keynesian approach. The curve Z, as drawn, represents an aggregate 
supply function, and it is premised on the short-period (Keynesian) 
assumption of a given stock of capital equipment, and it also as- 
sumes all along its course that the money wage level is held constant: 
this last hypothesis is tantamount to Keynes’ analysis in terms of a 
given wage unit or money wage structure. As the curve is drawn, 
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T See my An Approach to the Theory of Income Distribution (Philadelphia: Chilton 
Company, 1958), Chapter 2. For references to the earlier literature on the Z-function, 
see p. 24. 
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rising at an increasing rate to the right, it reflects some implicit 
diminishing returns phenomena and thus some necessary rising prices 
(of moderate proportions) as employment advances. Each point on 
the Z-function is to be interpreted as signifying the volume of outlays 
expected by firms in the economy in the form of consumer expenditures 
and investment outlays, which will lead the enterprises to give the 
associated level of employment.’ Thus at an expected outlay level of 
Z, the employment that firms will create is N,, ete. Manifestly, by 
writing the concomitant production function we can tether each em- 
ployment level to a particular real output level, as N, to Y,, etc. 

On the same diagram we can draw the aggregate demand function 
(D). This shows the actual outlays, as distinct from the necessary 
or expected outlays along the Z-function, associated with each N-level 
and the implicit money wage payments to labor. Further, at each 
demand point, as at D,, precisely the same price level is built into 
this point as is embodied in the corresponding Z, point at the N, 
amount of labor. At employment N,, say, as prices rise because of 
the phenomena of diminishing returns while money wages hold stable, 
the higher price level implicit at point Z, is also subsumed in the 
associated D, expenditure sum. 

Consider what this means for the stability analysis or the examina- 
tion of nonequilibrium positions. At employment N,, for example, 
there will be excess demand; in the Walrasian sense of microeconomic 
markets, of goods wanted at the given level of prices outrunning the 
quantities being offered for sale at the same price scale. To the right 
of N,, as we get closer to equilibrium and as prices rise, the level of 
excess demand dwindles until it disappears entirely at the equilibrium 
employment level N*. To the right of N*, at the still higher price 
levels implicit at these points, excess supply rather than excess demand 
obtrudes. 

Price level assumptions, and changing price level phenomena, are 
thus always integral parts of this analysis: whereas Figures 1 and 2 
have exorcised price level concepts, Figure 3 keeps them always in 
mind and in the forefront of the discussion. In general, apart from 
technological change and/or a change in the volume of capital equipment, 
changes in money wages will be the primary reason for parametric 
shifts in both the Z and D functions. Thus whenever the money wage 


8 This would suggest that the Z-function should be decomposed into these components. 
But exactly the same observation may be made with respect to the 45° line approach. 
For an early recognition of this, see Dudley Dillard, The Economics of John Maynard 
Keynes (New York: Prentice-Hall, Inc., 1948), p. 35n. 
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changes, and whatever the reason for the change, from this diagram- 
matic apparatus it becomes instantly clear that: (a) the price level 
must change, and (b) the money income or gross national product (Z) 
in money terms must change, while (c) employment and real output 
may change only if the new intersection of D and Z functions occurs 
at a point other than above N*. If the employment level is affected 
by the money wage change—and Keynes himself was largely a skeptic 
on this point—then it should be possible to derive a theory of money 
wages by aid of this apparatus.’ 

I think this very brief exposition will suffice for my purpose. From 
the analysis surrounding Figure 3 it can immediately be argued that 
even if money wages were held constant there would still be price 
level movements of greater or lesser degree, due to the laws of returns 
(and changing degrees of monopoly power). More important for the 
great controversies of the last decade or so, the argument leads in- 
exorably to the conclusion that the prime mover in price level fluctua- 
tions, or the factor of overriding importance in discussions of inflation, 
is the wage level and incipient changes in it. This is the heart of 
the ‘‘cost-push’”’ thesis: it devolves naturally and inevitably from this 
diagrammatic approach in contrast to its submersion into utter obscurity 
in the 45° line story. Excess demand, or a rise in the D-function in 
Figure 3, when unaccompanied by a money wage chanye, is seen to 
have a much less significant impact on the price level, which then 
merely responds to the laws of return. Conversely, a rise in the 
money wage level either as a result of changing employment levels 
or because of sheer exogenous factors, such as the need of labor union 
leaders to meet the desires of their membership, is shown always to 
have a clear and direct price level incidence. 


8. THE BASIS AND POLICY CONSEQUENCES OF 
THE DIVERGENCE IN VIEWS 


It is well to refiect for a moment on how it is that the two 
Keynesian wings have come up with such divergent views on the 
inflation problem, with one stressing the ‘‘demand-pull’’ and the other 
the money wage level and the ‘‘cost-push.”’ 

For myself, I think that at the bottom of the difference in views 
there lies the choice of working units by the separate groupings, with 
the one thinking largely in terms of real output or money amounts 


* I originally adopted this apparatus for just this purpose, a theory of money wage 
determination. See my article, ‘‘A Macroeconomic Approach to the Theory of Wages,” 
American Economic Review, (December, 1956). 
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corrected for price level changes, while the other has regarded the 
money wage as the appropriate numéraire and the employment unit 
as the bed-rock of analysis. On this interpretation it is the difference 
in the choice of deflators that seems to have been responsible for the 
mischief. While we may agree with Professor Hansen in principle 
when he wrote that ‘‘fundamentally the matter [of the choice of units, 
of real output or money wage] is of no great consequence,’’” it seems 
to me that this very matter has led to a great practical cleavage with 
vast policy implications. For when one thinks of real output as basic, 
it is an easy step to envisage inflation as ensuing when real output 
can no longer be increased. On the other hand, when wage units are 
recognized as the numéraire, the value of the money wage is recognized 
as crucial in deciding the level of prices. 

One approach thus leans inevitably to policy recommenda ns designed 
to hold wage requests in check. The other school, the dominant 
Keynesian branch, argues in terms of containing the level of demand 
to rein in prices, and thereby money wages. But here is the crucial 
anomaly which must sometimes rend this school, and the conflicting 
groups, even further: how far must real demand be cut back to control 
inflation? Must actual unemployment be created? If so, how much? 
If unemployment is not undertaken to discipline labor, can destabiliz- 
ing wage movements be prevented? 

If unemployment is the answer to the inflaticn problem, then Key- 
nesianism as a social philosophy is dead, literally interred by Keynesians 
and, curiously, all in the name of the mentor. For to restrain inflation 
its ideas must be used—perverted, I think—to create, not abolish, 
unemployment. My own belief is that Lord Keynes’ spirit must some- 
where be stirring with more than a little discomfiture at the ultimate 
irony of this strange twist to his ideas. 

I think that this great antithesis has been recognized, though not 
in this form, by several who approach Keynesianism in terms of 
aggregate demand and ‘‘demand-pull,’’ and are chary about wage 
controls because they do not fit readily into their philosophical and 
analytical systems of thought. I refer to the great emphasis of this 
group on the role of economic growth as the answer to inflation, and 
their willingness to tolerate what is now fashionable to call ‘‘creeping 
inflation.”’ But growth, if it brings with it exorbitant push-ups in 


” Alvin H. Hansen, A Guide to Keynes (New York: McGraw-Hill Book Co., Inc., 1953), 
p. 44. In this point, as in so many others, Professor Hanson has written with a refreshing 
breadth and candor on a matter which most other writers since Keynes’ day have not even 
bothered to discuss. 
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wages, will disappoint their hopes for merely ‘‘creeping inflation,’’ 
and so the issue will ultimately have to be faced. After all, we have 
had tremendous growth since, say 1940, and yet, more than a doub- 
ling of the price level. Growth, thus, is hardly an answer to the in- 
flation problem. 

The only alternative to conscious and deliberate creation of unemploy- 
ment in an economy in which the forces of aggregate demand are 
running strong, and one which couid still preserve the analytical and 
philosophical system fashioned by Lord Keynes, would be some deliber- 
ate design and control of the money wage level, just as we have 
learned to use fiscal and monetary policy to control aggregate demand. 
Only this could permit the social philosophy and moderate policy 
intervention attitude of Keynes to survive in a world of reasonably 
full employment. Otherwise, Keynesianism would have to be inten- 
tionally reversed, with the inteilectual framework originally created 
to eradicate the curse of unemployment deliberately and perilously 
distorted, to be used as an analytical engine to foster unemployment, 
so that the other evil, permanent inflation as a way of life, is repressed 
in favor of more tolerable standards of price level stabilization. How 
to contain the money wage level while maintaining full employment, 
and without breeding unacceptable interferences in labor markets, 
comes close to being the most important policy issue of our day.” 


University of Pennsylvania, U.S.A. 


11 Among Keynesians who have given what I think to be the appropriate emphasis tc the 
money wage as the dominant element in inflation, | would cite Professor A. P. Lerner 
who has described ‘‘seller’s inflation’? in a number of papers (e. g., Hearings, Joint 
Economic Committee, May 1958, p. 128), T. Balogh, ‘‘Productivity and Inflation,’’ Oxford 
Economie Papers, (June, 1958), and Nicholas Kaldor, ‘Inflation and Economic Growth,” 
Economica, (November, 1959), especially pp. 292-296. 
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Die theoretische Nationalékonomie ist noch weit vom Status einer exakten Wissen- 
schaft entfernt. Die vorliegende Arbeit hat sich die Aufgabe gestellt, ein Funda- 
ment fiir eine exakte Nationalékonomie zu errichten. 

Die in friiheren Zeiten verschiedentlich aufgestellte Behauptung, daB die National- 
Skonomie nie eine exakte Wissenschaft werden kénne, wird wohl kaum mehr ernst 
genommen und es fehit nicht an Versuchen, eine derartige Theorie von verschie- 
denen Seiten in Angriff zu nehmen: Die ,,Okonometrie’‘ geht in analoger Rich- 
tung wie die exakten Naturwissenschaften vor, indem sie versucht, von den Er- 
scheinungen induktiv auf die Gesetze des Wirtschaftens zu schlieBen (wie der Autor 
einleitend ausfiihrt, kann dieser Methode nicht der gleiche Erfolg beschieden sein 
wie in den Naturwissenschaften, da in der Wirtschaft keine Versuche mit Varia- 
tion von nur wenigen GréGen angestellt werden kénnen); rein mathematische 
Theorien stellen Axiomensysteme auf —¢'ne Arbeit, die erst wertvoll wird, so- 
bald man wei8, welche Axiomensysteme der theoretischen Nationalékonomie adaquat 
sind —; die Wiener Schule schlieBlich zeigte den Weg bzw. den Ausgangspunkt, 
wie der Autor im ersten Abschnitt darlegt. 

Die unmittelbar evidenten Grundtatsachen des menschlichen Verhaltens werden als 
Grundlage der ganzen Theorie analysiert, und zwar wesentlich tiefer schirfend 
als yon den Begriindern und Nachfahren der Wiener und der Lausanner Schule. 
Davon ausgehend wird der Ablauf der Wirtschaft unter Verwendung einer még- 
lichst einfachen Mathematik untersucht, wobei die unvermeidlichen ideaiisierenden 
Voraussetzungen in vieler Hinsicht weniger einschrinkend sind als etwa bei der 
Lausanner Schule und ihren Nachfahren. Fiir eine Weiterentwicklung der Theorie 
unter schrittweiser Aufhebung dieser idealisierenden Voraussetzungen bildet eine 
Arbeit wie die vorliegende die unentbehrliche Grundlage, da nur so die begriff- 
lichen Zusammenhange erstmals geklart und angemessene wirtschaftliche Begriffe 
exakt definiert werden kénnen. 
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